Top Banner
Belinda Seto, Ph.D. Belinda Seto, Ph.D. Deputy Director Deputy Director National Institute of National Institute of Biomedical Imaging and Biomedical Imaging and Bioengineering Bioengineering National Institutes of Health National Institutes of Health Implementing the NIH Implementing the NIH Data Sharing Policy: Data Sharing Policy: Expectations and Expectations and Challenges Challenges
32

Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Jan 26, 2016

Download

Documents

jalila

Implementing the NIH Data Sharing Policy: Expectations and Challenges. Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering National Institutes of Health (NIH). NIH Viewpoint. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Belinda Seto, Ph.D.Belinda Seto, Ph.D.Deputy DirectorDeputy Director

National Institute of Biomedical National Institute of Biomedical Imaging and BioengineeringImaging and Bioengineering

National Institutes of Health (NIH)National Institutes of Health (NIH)

Implementing the NIH Implementing the NIH Data Sharing Policy: Data Sharing Policy:

Expectations and Expectations and ChallengesChallenges

Page 2: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

“Data should be made as widely and freely available as possible while safeguarding the privacy of participants, and protecting confidential and proprietary data.”

-- NIH Statement on Sharing Research Data February 26, 2003

NIH Viewpoint

Page 3: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

NIH NIH expectsexpects timely release and sharing of final timely release and sharing of final research data for use by other researchers.research data for use by other researchers.

NIH NIH expectsexpects grant applicants to include a plan grant applicants to include a plan for data sharing or to state why data sharing is for data sharing or to state why data sharing is not possible, especially if $500K or more of not possible, especially if $500K or more of direct cost is requested in any single yeardirect cost is requested in any single year

NIH NIH expectsexpects contract offerors to address data contract offerors to address data sharing regardless of costsharing regardless of cost

Effective October 1, 2003

NIH Data Sharing Policy

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 4: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Challenges

Cultural ChallengesCultural Challenges– Obtaining data in a traditionally data sharing adverse Obtaining data in a traditionally data sharing adverse

environmentenvironment– Overcoming the competitive and costly “silo” approach Overcoming the competitive and costly “silo” approach

to biomedical researchto biomedical research– Removing barriers to information flow across the Removing barriers to information flow across the

complex, heterogeneous environmentcomplex, heterogeneous environment Technical ChallengesTechnical Challenges

– Dealing with a lack of interoperable technologies, Dealing with a lack of interoperable technologies, unifying architectures, standards, and terminologiesunifying architectures, standards, and terminologies

– Implementing strategies to process and analyze Implementing strategies to process and analyze terabytes of data efficientlyterabytes of data efficiently

– Maintaining systems in a biologically changing Maintaining systems in a biologically changing environmentenvironment

– Securing, protecting, and tracking patient data across Securing, protecting, and tracking patient data across disparate systemsdisparate systems

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 5: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Data Sharing Models

NIH serves as central data NIH serves as central data repositoryrepository

A federated model with grantee A federated model with grantee institutions provide data institutions provide data repositoriesrepositories

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 6: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

NIH Central Data Repositories

Genome-wide association studyGenome-wide association study GenBankGenBank Protein ClusterProtein Cluster PubChemPubChem Many others at: Many others at:

http://www.nlm.nih.gov/databases/http://www.nlm.nih.gov/databases/

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 7: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Genome-wide Association Studies (GWAS): Purpose,

Goals To identify common genetic factors that To identify common genetic factors that

influence health and diseaseinfluence health and disease To study genetic variations, across the To study genetic variations, across the

entire human genome, that are entire human genome, that are associated with observable traitsassociated with observable traits

To combine genomic information with To combine genomic information with clinical and phenotypic data to clinical and phenotypic data to understand disease mechanism and understand disease mechanism and prediction of diseaseprediction of disease

To develop the knowledge base for To develop the knowledge base for personalized medicinepersonalized medicine

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 8: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

GWAS Data Sharing Policy

All GWAS-funded investigators are All GWAS-funded investigators are expected to submit to the NIH data expected to submit to the NIH data repository descriptive information, repository descriptive information, curated and coded phenotype, curated and coded phenotype, exposure, genotype, and pedigree exposure, genotype, and pedigree data as soon as quality control data as soon as quality control procedures are completed at the procedures are completed at the grantee institutions. grantee institutions.

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 9: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Database of Genotype and Phenotype (dbGP)

Serves as a single point of access to Serves as a single point of access to GWAS dataGWAS data

To archive and distribute results from To archive and distribute results from studies of the interaction of genotype studies of the interaction of genotype and phenotypeand phenotype

Provides pre-competitive data, no IP Provides pre-competitive data, no IP protectionprotection

Encourages use of primary data from Encourages use of primary data from dbGP to develop commercial products or dbGP to develop commercial products or teststestsN

ati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 10: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Protection of Research Participants: De-

Identification NIH does not possess direct identifiers of NIH does not possess direct identifiers of research participants; does not have research participants; does not have access to link between data keycode and access to link between data keycode and identifiable information; such information identifiable information; such information resides with the grantee institutionsresides with the grantee institutions

Research institutions submitting dataset Research institutions submitting dataset must certify that an IRB and/or Privacy must certify that an IRB and/or Privacy Board has considered and approved the Board has considered and approved the submissionsubmission

Investigators must stripped the data of Investigators must stripped the data of all identifiers before data submissionall identifiers before data submission

Optional: Certificate of ConfidentialityOptional: Certificate of Confidentiality

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 11: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Protection of Research Participants: Informed

Consent NIH expects specific discussion and NIH expects specific discussion and documentation that participants’ genotype documentation that participants’ genotype and phenotype data will be shared for and phenotype data will be shared for research purposes through dbGP research purposes through dbGP

If participants withdraw consent for sharing If participants withdraw consent for sharing individual-level genotype and phenotype individual-level genotype and phenotype data, the submitting institution will be data, the submitting institution will be responsible for requesting the dbGP to responsible for requesting the dbGP to remove the data involved from future data remove the data involved from future data distributions.distributions.

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 12: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Data Access

Requesters are expected to meet Requesters are expected to meet data security measures: physical data security measures: physical security, information technology security, information technology security and user trainingsecurity and user training

Requires signed data use certification:Requires signed data use certification:– Proposed research use of dataProposed research use of data– Follows local lawsFollows local laws– Not sell data elementsNot sell data elements– Not share with individuals not listed in proposalNot share with individuals not listed in proposal– Provide annual progress reportsProvide annual progress reports

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 13: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

dbGP Access: Two Levels

Open-access data includes:– summaries of studies– study documents, reports– measured variables, e.g., phenotypes– genotype-phenotype analyses

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 14: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

dbGP: Controlled-Access

Requires varying levels of authorization

Provides data on a per-study basis Controlled-access data includes:

– De-identified phenotypes and genotypes for individual study subjects

– Pedigrees– Pre-computed univariate association

between genotype and phenotypeNati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 15: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Controlled-Access Data Requests

Requester must submit a Data Use Certification

Access is granted by an NIH Data Access Committee

Approval of proposed research use will be consistent with patient consent and data provider’s institutional terms and conditions

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 16: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Intellectual Properties?

Discourages premature claims on pre-competitive information that may impede research

Encourages patenting of technology for downstream product development, e.g.,– Markers for assays– Drug targets– Therapeutics– diagnostics

Up to one year of exclusivity is allowed for the primary investigators to submit GWAS data analyses for publication

Clock begins when the GWAS datasets is first made available to the NIH data repository

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 17: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

The National Longitudinal Study of Adolescent Health

(Add Health):

An Example of Sensitive Data and Multi-Tiered

Access

Example of Grantee Institution Providing

Access

Page 18: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

The National Longitudinal Study of Adolescent Health

(Add Health) 20,745 adolescents enrolled in grades 7-12, 20,745 adolescents enrolled in grades 7-12,

followed between 1994 and 2002. followed between 1994 and 2002. Data from:Data from:

– adolescents and parents; adolescents and parents; – 90,118 students attending sample 90,118 students attending sample

schools; schools; – school administrators;school administrators;– independent data on independent data on

neighborhood/community neighborhood/community Data collected in three waves, 1994 - 2002.Data collected in three waves, 1994 - 2002. Measures of:Measures of:

– health health – health-related behaviors (e.g., sex, drugs)health-related behaviors (e.g., sex, drugs)– determinants of health at the individual, determinants of health at the individual,

family, school, peer group, and family, school, peer group, and community level.community level.N

ati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 19: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Challenges to Sharing DataChallenges to Sharing Data Data sensitivity Need to protect confidentiality Danger of deductive disclosure

Add Health: Sensitive Data Sharing Example

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 20: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

A further challenge…A further challenge…The timely release of these public use samples The timely release of these public use samples

is essential. Reviewers understand this to is essential. Reviewers understand this to mean that investigators outside of the mean that investigators outside of the Carolina Population Center will have ready Carolina Population Center will have ready access to the data as soon as investigators access to the data as soon as investigators inside the center have such access. inside the center have such access. Procedures for the guarantee of confidentiality Procedures for the guarantee of confidentiality … should apply to all users, both the general … should apply to all users, both the general public and those at University of North public and those at University of North Carolina.Carolina.

Add Health: Sensitive Data Sharing Example

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 21: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Solution: a multi-tiered Solution: a multi-tiered systemsystem Public use data Contractual data sets Cold room for on-site data use

Add Health: Sensitive Data Sharing Example

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 22: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Public use dataPublic use data Made available through Sociometrics, a

small business data archive Contains only a subset of cases (6,504) Rare over-samples not included Contains most data on included cases Potentially identifying information

redacted

Add Health: Sensitive Data Sharing Example

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 23: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Restricted-use contractual Restricted-use contractual datadata Full data set available only under contract Available to researchers with:

– IRB- and UNC-approved data security plan

– Signed agreement to maintain confidentiality

– Fee covering costs of providing data & user support; monitoring compliance

Requires annual progress report and renewal after 3 years

Add Health: Sensitive Data Sharing Example

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 24: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Cold room for on-site use Initial plan required access to

some data only on-site at UNC Cold room constructed at UNC Limited use to date

Add Health: Sensitive Data Sharing Example

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

medic

al Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 25: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Data security caveatsData security caveats Security requirements require periodic

updating as technology advances IRBs often lack understanding of

security needs Smaller institutions handicapped in

creating secure environments for restricted data

Add Health: Sensitive Data Sharing Example

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 26: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Impact of Sharing Ad Health Data

700 publications 1000 conferences 100 dissertations

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 27: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Challenges: Sharing Challenges: Sharing Image DataImage Data

Data acquisition from different vendor machines

Data processing with different software tools

Terabytes of data Open architecture? Open access? Interoperability?

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 28: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

T2 Weighted Images

Turbo Spin Echo3T with fat suppressionPhilips

Turbo Spin Echo1.5T with fat suppressionGE

Page 29: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

T2 Weighted Images

Single Shot Fast Spin Echo3T Philips

Single Shot Fast Spin Echo1.5T GE

Page 30: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Sharing Data in DatabasesGoal: Openly share data in a Goal: Openly share data in a

commonly accepted formatcommonly accepted format

Challenges: need to develop and Challenges: need to develop and maintain a database infrastructure maintain a database infrastructure that persists beyond the project that persists beyond the project duration; need for standards for duration; need for standards for quality control and quality quality control and quality assuranceassuranceN

ati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 31: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Use Case: Osteoarthritis Initiative

A public private partnership:A public private partnership: To improve diagnosis and To improve diagnosis and

monitoring of osteoarthritis monitoring of osteoarthritis To foster development of new To foster development of new

treatmentstreatments Provide publicly accessible Provide publicly accessible

databasedatabase Utilize existing infrastructureUtilize existing infrastructure

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng

Page 32: Belinda Seto, Ph.D. Deputy Director National Institute of Biomedical Imaging and Bioengineering

Budget ConsiderationBudget Consideration

Hardware, software, and storage Hardware, software, and storage space, space,

IT support and maintenanceIT support and maintenance Software toolsSoftware tools

Nati

on

al In

stit

ute

of

Nati

on

al In

stit

ute

of

Bio

med

ical Im

agin

g a

nd

B

iom

edic

al Im

agin

g a

nd

B

ioen

gin

eeri

ng

Bio

en

gin

eeri

ng