Top Banner
iLastic: Linked Data Generation Workflow & User Interface for iMinds Scholarly Data SAVE-SD 2017 Anastasia Dimou , Gerald Haesendonck, Martin Vanbrabant, Laurens De Vocht, Ruben Verborgh, Steven Latré, Erik Mannens [email protected] @natadimou Ghent University – IDLab – imec
53

iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Jan 29, 2018

Download

Technology

andimou
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic:Linked Data Generation

Workflow & User Interface for iMinds Scholarly Data

SAVE-SD 2017

Anastasia Dimou, Gerald Haesendonck, Martin Vanbrabant, Laurens De Vocht, Ruben Verborgh, Steven Latré, Erik Mannens

[email protected] ● @natadimouGhent University – IDLab – imec

Page 2: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is archived & published by the

event organizers where it was presentedpublisher who publishes the proceedingsauthors who co-edited itorganization(s) the authors are affiliated with

Page 3: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication

Dimou A. et al. (2015) Assessing & Refining Mappings to RDF to Improve Dataset Quality In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015 Lecture Notes in Computer Science, vol 9367. Springer, Cham

Page 4: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is archived & published by the

event ISWC2015http://iswc2015.semanticweb.org/sites/iswc2015.semanticweb.org/files/93670111.pdf

Page 5: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is archived & published by the

event ISWC2015publisher LNCS, Springer

https://link.springer.com/chapter/10.1007/978-3-319-25010-6_8

Page 6: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is archived & published by the

event ISWC2015publisher LNCS, Springerauthors multiple by 8

https://ruben.verborgh.org/publications/dimou_iswc_2015a/http://jens-lehmann.org/files/2015/iswc_rml_rdfunit.pdf

Page 7: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is archived & published by the

event ISWC2015publisher LNCS, Springerauthors multiple by 8organization(s) multiple by 5

https://biblio.ugent.be/publication/8030828

Page 8: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is archived & published 15 times!!

Dimou A. et al. (2015) Assessing & Refining Mappings to RDF to Improve Dataset QualityIn: Arenas M. et al. (eds) The Semantic Web - ISWC 2015Lecture Notes in Computer Science, vol 9367. Springer, Cham

Page 9: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is published 15 times...

… if all agents publish its scholarly data as Linked (Open) Data

Page 10: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Publication is published N times...

… if N agents publish its scholarly data as Linked (Open) Data

Linked (Open) Data is generated with N different ways

Page 11: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing

enhances the meaning of publications by enriching them with metadata

Page 12: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing: ad-hoc solutions

different agents ownoverlapping or complementary scholarly data

use their own ad-hoc solutionsto generate and publish their own Linked (Open) Data

Page 13: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing: fragmented datasets

different agents ownoverlapping or complementary scholarly data

focus on metadata or content, rarely on both

content annotations are rarely published as datasets

Page 14: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing: currently leading to..

duplicate efforts for Linked (Open) Data generation:

(re-)implementing from scratch

non-negligible implementation & maintenance costs

Page 15: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing: current

effort for Linked (Open) Data generation:

implementation & maintenance ↗

Page 16: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing: our approach

effort for Linked (Open) Data generation:

implementation & maintenance ↘

model, semantic annotations, integration & cleansing ↗

Page 17: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

How can we reduce implementation costsincrease Linked Data quality?

Page 18: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing: our approach

general-purpose Linked (Open) Data generation and publication workflow

adjusted to each agent’s scholarly data

integrates metadata & content annotations

Page 19: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Semantic Publishing: iLastic

general-purpose Linked (Open) Data generation and publication workflowbased on our modular RML tool chain

adjusted to iMinds & Ghent university repositoryoverlapping and complementary scholarly data

integrates metadata & content annotationsbased on the RML tool chain & text enricher alignment

Page 20: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment service

Page 21: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 22: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 23: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment service

Page 24: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment service

Page 25: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicegeneral purpose tool: distinct mapping rules definition & execution

Enrichment service

Page 26: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

Mapping Module

Processor

Extraction Module

mapping rules

Page 27: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicegeneral purpose tool:distinct mapping rules definition & executionexecution: RML Processor

Enrichment service

https://github.com/RMLio/RML-Processor

Page 28: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 29: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicegeneral purpose tool:distinct mapping rules definition & executionexecution: RML Processordefinition

Enrichment service

Page 30: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicegeneral purpose tool:distinct mapping rules definition & executionexecution: RML Processordefinition: RML language

Enrichment service

A. Dimou et al. (2014) RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data. In Proceedings of the 7th Workshop on Linked Data on the Web (LDOW2014), Seoul, Korea.http://rml.io

Page 31: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicegeneral purpose tool:distinct mapping rules definition & executionexecution: RML Processordefinition: RML Editor

Enrichment service

Heyvaert P. et al. (2016) RMLEditor: A Graph-Based Mapping Editor for Linked Data Mappings. In The Semantic Web. Latest Advances and New Domains. ESWC 2016. LNCS, vol 9678. Springer, Chamhttps://www.youtube.com/watch?v=0lPDaghlZoQ

Page 32: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 33: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicegeneral purpose tool:execution: RML Processordefinition: RML Editorvalidation

Enrichment service

Page 34: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicegeneral purpose tool:execution: RML Processordefinition: RML Editorvalidation: RML Validator

Enrichment service

Dimou A. et al. (2015) Assessing and Refining Mappingsto RDF to Improve Dataset Quality. In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015. Lecture Notes in Computer Science, vol 9367. Springer, Cham

Page 35: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 36: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment service

Page 37: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment service

Page 38: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 39: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment servicePDF Extraction: CERMINE

http://cermine.ceon.pl/

Page 40: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 41: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment servicePDF Extraction: CERMINENER: DBpedia Spotlight

https://github.com/dbpedia-spotlight/dbpedia-spotlight

Page 42: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 43: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication service

Enrichment service

Page 44: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Dataset

59,462 entities12,472 researchers22,728 publications81 organizations3,295 projects765,603 triples

Page 45: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicedata dumpsLinked Data Fragments

Enrichment service

http://linkeddatafragments.org/

Page 46: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicedata dumpsLinked Data FragmentsSPARQL endpoint - Virtuoso

Enrichment service

https://github.com/openlink/virtuoso-opensource

Page 47: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data
Page 48: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic Workflow

RDF generation & publication servicedata dumpsLinked Data FragmentsSPARQL endpoint - VirtuosoThe DataTank

Enrichment service

http://thedatatank.com/

Page 49: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic User Interface

Page 50: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic User Interface

Page 51: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic User Interface

Page 52: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic User Interface

https://www.youtube.com/watch?v=ZxGrHnOuSvw

Page 53: iLastic: Linked Data Generation Workflow and User Interface for iMinds Scholarly Data

iLastic:Linked Data Generation

Workflow & User Interface for iMinds Scholarly Data

[email protected] ● @natadimou