Top Banner
Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing Department STFC Rutherford Appleton Laboratory [email protected]
21

Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Dec 24, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Scientists are Sensitive too:

Some Issues in Research ethics arising from Data Sharing  

Brian Matthews

Scientific Information GroupScientific Computing Department

STFC Rutherford Appleton Laboratory

[email protected]

Page 2: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

What’s this talk about?

• Cultural barriers to sharing data– Ethics of unrestricted access

• How this affects what we do– Data Policy– Implementation of Data Publication

• Stimulate discussion

Page 3: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

“We must give taxpayers more bang for their buck. Open access to papers and data will speed up important breakthroughs by our researchers and businesses, boosting knowledge and competitiveness in Europe.”

Máire Geoghegan-Quinn,

European commissioner for research, innovation and science

July 2012

http://europa.eu/rapid/press-release_IP-12-790_en.htm?locale=en

.

Page 4: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Opportunities for Data Exchange (ODE)

• EC FP7 Project: 2010-12• Workshops and interviews

– Conceptual model– Drivers, barriers, enablers to data

sharing

R. Darby, S. Lambert, B. Matthews, M. Wilson, K. Gitmans, S. Dallmeier-Tiessen, S. Mele, J. Suhonen Enabling Scientific Data Sharing and Re-use. IEEE Conf. on E-Science, Chicago, Oct 2012.

http://www.alliancepermanentaccess.org/index.php/community/current-projects/ode/

Page 5: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Drivers for Data Sharing• Societal benefits

– Economic/commercial benefits; – Better quality decision making in government and commerce;

• Academic benefits– The integrity of science as an activity is increased

• Research benefits– For the data contributor:

• Validation of scientific results by other scientists; • Recognition of their contribution.

– For the data user: • Re-Use of data in new analysis• Re-use of data in metastudies ;• Re-use of data in interdisciplinary studies;

• Organisational benefits– Producer Organisation: enhances organizational profile; – Publisher Organisation: adds value to the product.– Infrastructure Organisation: reputation as "data holder with expert

support" is increased – Consumer Organisation: can use data to make policy decisions;

Page 6: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Cultural Barriers to Data Sharing

• Publisher Practises:– Journal articles do not describe available data as a publication– Data not recognised as a citable publication– Lack of data reviewers to assess data quality

• Research Assessment– Publication and citation of data not tracked– Not counted as part of performance evaluation for careers

• Academic Defensiveness– Fear that others will benefit from their data and gain priority for results– Fear that their results will not be validated– Fear that misuse of data will harm the data contributor– Fear that use of data to support arguments the data contributor disagrees with

• Personal data confidentiality– Anonymity of subjects in medical and social science in particular– Perceived conflicts between data protection and FOI

Thus unrestricted data access has ethical implications

Page 7: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

“By confusing the allocation of scientific merit and potentially undermining authorship conventions, data sharing could work against individual scientists' need for recognition”

Gerrit Hirschfeld, Open science: Data sharing is harder to reward

Correspondence, Nature Volume: 487, Page: 302 : (19 July 2012) DOI: doi:10.1038/487302c

Page 8: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

RCUK Principles on Data Policy

Common Principles1. Public good 2. Preservation3. Discoverability4. Confidentiality 5. First use6. Recognition7. Costs

A tension between these principles

http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx

Publicly funded research data are a public good, produced in the public interest,

which should be made openly available with as few restrictions as possible in a

timely and responsible manner that does not harm intellectual property.

RCUK recognises that there are legal, ethical and commercial constraints on

release of research data. To ensure that the research process is not damaged by inappropriate release of data, research

organisation policies and practices should ensure that these are considered at all

stages in the research process

Page 9: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

So how to do we implement a data management policy and infrastructure which acknowledges and manages these tensions ?

Page 10: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.
Page 11: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

• ~30,000 user visitors each year in Europe: – physics, chemistry, biology, medicine,

energy, environmental, materials, culture, pharmaceuticals, petrochemicals, microelectronics

• Billions of € of investment– c. £400M for DLS + running costs

• Over 5.000 high impact publications per year in Europe

The Science We Do: Large Scale Facilities

Fitting experimental data to model

Bioactive glass for bone growth

Structure of cholesterol in crude oil

Hydrogen storage for zero emission vehicles

Magnetic moments in electronic storage

Longitudinal strain in aircraft wing

Diffraction pattern from sample

Visit facility on research campus

Place sample in beam

• Data management infrastructure– Capture,– Process– Store– Catalogue

• Link to publications• Common infrastructure in Europe

Page 12: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

A Facility Data Repository• A bit like a University research repository

– Data generated from within our institution– by nature collaborative with other institutions

• A bit like a Subject Repository– Data collected via “Neutron or Synchrotron” science– not, one discipline, nor the whole of any discipline– Do not have a mandate to aggregate and disseminate data within

the discipline

• A bit like a Memory institution– Public funding to collect, preserve and make available data– No obligation to deposit/mandate to archive

Supporting our user community

Page 13: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

ISIS Data Policy • ISIS Neutron Source Data Policy

http://www.isis.stfc.ac.uk/user-office/data-policy11204.html

– Consultation with science user community

• Influencing Diamond Synchrotron policy• Now influencing policy across Europe

– PaNData data policy framework– For similar facilities http://wiki.pan-data.eu/imagesGHD/0/08/PaN-data-D2-1.pdf

Page 14: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Some policy details3.1.1 All raw data and the associated metadata obtained as a result of free (non-commercial)

access to ISIS, reside in the public domain, with ISIS acting as the custodian

3.3.2 Access to the on-line catalogue will be restricted to those who register with STFC/ISIS as users of the on-line catalogue.

3.3.3 Access to raw data and the associated metadata obtained from an experiment is restricted to the experimental team for a period of three years after the end of the experiment. Thereafter, it will become publicly accessible. Any PI that wishes their data to remain ‘restricted access’ for a longer period will be required to make a special case to the Director of ISIS.

3.3.6 The on-line catalogue will enable the linking of experimental data to experimental proposals.  Access to proposals will only ever be provided to the experimental team and appropriate STFC staff, unless otherwise authorized by the PI. 

4.1.1 Ownership of all results derived from the analysis of the raw data is determined by the contractual obligations of the person(s) performing the analysis.

5.4 PIs and researchers who carry out analyses of raw data and metadata are encouraged to link the results of these analyses with the raw data / metadata using the facilities provided by the on-line catalogue.  Furthermore, they are encouraged to make such results publicly accessible.

Page 15: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

TopCat

Page 16: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Data Publication using DOIs• Use DataCite service to:

– mint, sustain, search and discover digital object identifiers (DOI)

• DOIs are issued per experiment– experiments collecting raw data – Easy for facility– May also want finer granularity (datasets, datafiles)

• Makes data a research publication– Data is citable like a journal article– Can be accessed and quality assured– Bibliometric services count data citation frequency for impact, e.g. REF

submission

• Experimenters can gain credit for collecting data

Page 17: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Easton,S; Barnes,C H W; Ionescu,A; (2011): RB820232: Magnetic moment of EuO in spin filtering magnetic tunnel structures.; STFC ISIS Facility. doi:10.5286/ISIS.E.24066298

DOI Data Access Process

Page 18: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Issues of using DOIs • The DOI is issued when the experimental time is allocated.

– we want to identify the experiment itself – encourage use of the DOI

• DataCite require: authors, title, date, publisher to be entered when the DOI is issued. – But this can give a information away too early

• before expiration of embargo period.• before publication of results • before derivation of results

– Even releasing Metadata could be unethical (or at least break our poiicy)

• Should the DOI be issued later so that this information is only made public later ? – When data is collected ?– When data is released ?

• The minimum metadata are available from the registry when data are collected:– Update the metadata when the data is released.

Page 19: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.
Page 20: Scientists are Sensitive too: Some Issues in Research ethics arising from Data Sharing Brian Matthews Scientific Information Group Scientific Computing.

Summary• Open data is a public policy goal

– But need to bring along the research community– Ethical implications of unrestricted release– Data release should not damage the research process.

• Data publication– Provide data publication and citation mechanisms

• Data embargos– Exclusive access to research data – as a data collector– Embargo the metadata too– A crude one size fits all mechanism

• Recognition and rewards for data publication– a common system of credit and recognition for data production and sharing is

needed– provide researchers with clear instructions on how to cite data– Include data publication & citation metrics in researcher appraisal