Top Banner
deepcarbon.net Xiaogang Ma, Patrick West , John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer Polytechnic Institute From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies
35

Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

Jan 17, 2016

Download

Documents

Lesley Evans
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

deepcarbon.net

Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox

Tetherless World ConstellationRensselaer Polytechnic Institute

From data portal to knowledge portal:Leveraging semantic technologies to support interdisciplinary studies

Page 2: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

2

Outline

• Deep Carbon Observatory

• Deep Carbon Virtual Observatory (DCvO)

– Architecture of DCvO

– DCO Ontologies

– Boundary activities

– Discovering information by clicking through

• Summary

Page 3: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

3

A 10-year (2009-2019) initiative to intensify global attention and scientific effort in the burgeoning field of deep carbon science

Page 4: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

4

• Faculty, staff and students from the Tetherless World Constellation (TWC) at Rensselaer Polytechnic Institute (RPI)

• Responsible for– DCO Architecture and technology infrastructure– DCO Computer Cluster– The Deep Carbon Virtual Observatory DCvO

Deep Carbon Observatory – Data Science

Page 5: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

5

Deep Carbon Virtual Observatory

Scientists – actually ANYONE - should be able to access a global, distributed knowledge base of scientific data and information that:• appears to be integrated• appears to be locally available • is in a language (written, programming, or science)

that is understandable and can be shared

Data intensive – volume, complexity, mode, scale, heterogeneity, … in an OPEN WORLD

Page 6: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

6

Deep Carbon Virtual Observatory

• A vision of the DCvO:– A conceptual model of the interplay between data, people,

publication, instruments, models, organizations, etc.– Identify, annotate and link all key entities, agents and activities – A repository for datasets and associated metadata– Unique and powerful data and metadata visualization for

dissemination of information– Facilitates the discovery of potential collaborations– An integrated portal for diverse content and applications

(Fox et al., 2014)

Page 7: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

7

DCvO “Architecture”

Page 8: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

8

vivo.cornell.edu

VIVO - represents academic research

communities

DCO ontology: a model for concept types and relationships

DCO ontologies extend each other and the VIVO ontology

Page 9: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

9

Ontologies and schemas used in the DCO web portal

Name Prefix

Dublin Core Metadata Element Set dc

DCMI Metadata Terms dct

VIVO Core vivo

VIVO Scientific Research Ontology scires

Data Catalog Vocabulary dcat

Bibliographic Ontology bibo

Citation Counting and Context Characterization Ontology c4o

Citation Typing Ontology cito

FRBR-Aligned Bibliographic Ontology fabio

Event Ontology event

Friend of a Friend foaf

vCard Ontology vcard

Geopolitical Ontology geo

Simple Knowledge Organization System skos

DCO Ontology dco

PROV Ontology prov

Page 10: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

10

Ontologies and schemas used in the DCO web portal

DCO Boundary Activities are driving the extensions within the DCO Ontologies

Page 11: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

11

DCO Extension for Project Updates

Page 12: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

12

Dynamically generated list of Grants that are part of the Deep Carbon

Observatory. Users can click through to learn more, and members can create

reports to be sent to funding orgs

Page 13: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

13

Grant page lists all projects and reporting updates for each of the

projects and field studies

Page 14: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

14

DCO Extension for Data Types

Page 15: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

15

A Few Boundary Activities

• Given a DOI pull publication information from CrossRef

and/or Web of Science

• DCO IGSN Allocation Agent to work with the IGSN

Registry

• Integration with existing data portals and repositories

• Data Rescue activities

Page 16: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

16

Modern informatics enables a new scale-free framework approach

• Use cases• Stakeholders• Modeling• Ontologies• Evaluation

Page 17: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

17

What does a DCO data publication look like?

Page 18: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

18

Identification and annotation

Information on the landing page of a dataset

Page 19: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

19

Linking to enable forward and backward tracking

Landing page of Helium Concept

Page 20: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

20

Landing page of a person

Linking to build Collaborations

Page 21: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

21

Landing page of a research area

Linking to build Collaborations

Page 22: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

22

DCO Knowledge Graph Analytics

Page 23: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

23

Thus… progress…

• Integrative – semantics• Transparent – semantics• Collaborative – semantics• Application integration

– Yep – semantics

Page 24: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

24

Thank you!Patrick West, [email protected], https://deepcarbon.net, http://tw.rpi.edu

Page 25: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

25

An integrated portal: deepcarbon.net

Page 26: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

26

Faceted publication

browser

Page 27: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

27

Repository for archiving datasets

Archived datasets of ‘Noble gas isotope abundances in

terrestrial fluids’

Page 28: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

28

Collaboration tools

Group Based CollaborationGroup data deposit and

reporting

Listings of group content

Group management

and messaging

Page 29: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

29

RDA DTR and PIT adoption

The DTR primitives are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, e.g. Dataset, Image, Video, Audio, etc.

A registered DCO dataset is asserted as an instance of one of those basic data type classes.

It is possible to further annotate the dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID.

A Few Boundary Activities

Page 30: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

Results of data type specification

• Updates to the DCO Ontology:– A new class dco:DataType. Each specific data type is an instance of it– An object property dco:hasDataType linking a dataset and a data type– A collection of other classes and properties associated with dco:DataType

30

Page 31: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

31

• New datasets available via dataset browser• Includes citations to the originating publication• Data files accessible through dataset repository

Thermodynamic Data Rescue

Page 32: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

32

DCO Knowledge Store Analytics

Page 33: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

33

DCO Knowledge Store Visualizations

Page 34: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

34

All information is linked and traceable!

Page 35: Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.

35

Mediation

From: C. Borgman, 2008, NSF Cyberlearning Report, Illustration by Roy Pea and Jillian C. Wallis

Guess

6th Generation

All these generations of mediation are in effect as we collaborate