Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University of Manchester, UK Alun Preece, Binling Jin Department of Computing Science University of Aberdeen, UK http://www.qurator.org
18
Embed
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Quality views: capturing and exploiting the user perspective on data quality
Paolo Missier, Suzanne Embury, Mark GreenwoodSchool of Computer ScienceUniversity of Manchester, UK
Alun Preece, Binling JinDepartment of Computing Science
University of Aberdeen, UK
http://www.qurator.org
Combining the strengths of UMIST andThe Victoria University of Manchester
Integration of public data (in biology)
GenBankUniProt
EnsEMBL
Entrez
dbSNP
• Large volumes of data in many public repositories• Increasingly creative uses for this data• Their quality is largely unknown
Combining the strengths of UMIST andThe Victoria University of Manchester
Quality of e-science data
Defining quality can be challenging:
• In-silico experiments express cutting-edge research
– Experimental data liable to change rapidly
– Definitions of quality are themselves experimental
• Scientists’ quality requirements often just a hunch
– Quality tests missing or based on experimental heuristics
– Often implicit and embedded in the experiment not reusable
Criteria for data acceptability within a specific data processing context
Criteria for data acceptability within a specific data processing context
A data consumer’s view on quality:
Combining the strengths of UMIST andThe Victoria University of Manchester