Personal Data Management • Why is this such an issue? Data Provenance • Representing links v Representing data • Identifying resources: Life Science Identifiers • Different types of provenance • Provenance generation • Provenance storage • Provenance retrieval
13
Embed
Personal Data Management Why is this such an issue? Data Provenance Representing links v Representing data Identifying resources: Life Science Identifiers.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Personal Data Management• Why is this such an issue? Data Provenance
• Representing links v Representing data• Identifying resources: Life Science Identifiers
• Different types of provenance
• Provenance generation
• Provenance storage
• Provenance retrieval
Problem
• Automated workflows produce lots of heterogeneous data
• These are just some of the results from one workflow run for Williams Disease
Amplification of results
One input
Many outputs
Link v Data Representation
• Data management questions refer to relationships rather than internal content– What are the origins of this data?
• Which service produced this data?• Which data is this derived from?• Who was this data produced for?• ?What is this data telling me?
• Data analysis questions delegated to external services.
Representing links
• Identify each resource– Life science identifier: URI with associated data and
metadata retrieval protocols.– Understanding that underlying data will not change