Top Banner
Building the CIARD Framework for Data and Information Sharing Praha, July 12, - johannes keizer Keynote at Knowledge Technology Week Kuala Lumpur, 2011, July 20 Dr. Johannes Keizer Office of Knowledge Exchange, Research and Extension Food and Agriculture Organization of the UN CIARD - creating a global framework for information sharing in agricultural research and innovation
51

2011 07 keynote-ktw

May 08, 2015

Download

Technology

Johannes Keizer

Keynote at the Knowledge Technology Week in Kuala Lumpur
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Keynote at Knowledge Technology WeekKuala Lumpur, 2011, July 20

Dr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN

CIARD - creating a global framework for information sharing in agricultural research and innovation

Page 2: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

“... FAO’s principle task is to work to ensure that the world’s knowledge of food and agriculture is available to those who need it when they need it and in a form which they can access and use ...”

Page 3: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

There will be generated more scientific data in the next 5 years than in the history of humankind

Page 4: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Contribution and Participation in Science

Territory size shows proportion of scientific papers published in 2001 by authors living there. Copyright SASI Group (University of Sheffield) and Mark Newman (University of Michigan)

Page 5: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 6: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 7: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 8: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The Internet!

Page 9: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Aggregation States of Knowledge

Page 10: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Data and Information in Agricultural Research and Extension

Page 11: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Distributed Repositories

• stats• gene banks• gis data• blogs, • journals• open archives• raw data• technologies• learning objects• ………..

Page 12: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Task 1: making services

? ? ?

Page 13: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Task 2: getting knowledge

? ? ?

Page 14: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

? ? ?

How can I get in real time all the specimen data on useful insects from all people making research on this on my desktop? How can I share in real time my data with other colleagues working on that.

Task 3: working together

Page 15: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://www.ciard.net

Page 16: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 17: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The Project: agINFRA

Enforce Webpublishing of Data Produce linked open data from

all datasets Use common reference

vocabularies to interlink data sets

Don’t wait ! Wrap the Legacy

Page 18: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 19: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Lod Generator: processLOD Generator

triplifier, concept and entity

identifier

Page 20: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Data Services: process

Page 21: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Under Construction !!!!!

VocBench

AGROVOC Linked Open Data

AgroTagger

Triplifying AGRIS

Serendipity linking

Drupal front ends for triple stores

The CIARD R.I.N.G

Page 22: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 23: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The VocBench VocBench

concepts and entities triples

Page 24: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

VocBench Features

Domain independent

Structure independent (i.e. thesauri, Glossaries, etc)

Supports RDF (SKOS, SKOS-XL), OWL

Supports collaborative editing

Supports editorial workflow, with user roles

Simple and advanced search

Supports data export: SKOS, Relational format (MySQL)

Page 25: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 26: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Further schemes in FAO

skos:broader

:bar

has_synonymhas_translation

skos:literalForm “maize”:foomaïs (fr)

:foo

has_synonymskos:literalForm “corn”

:bar

8171

1474

skosxl:altLabel

skosxl:prefLabel

skos:broader

has_synonym

SKOS Label

The AGROVOC concept scheme

SKOSConcept

rdf:type

rdf:type

6211

skos:broader

AGROVOCConceptScheme

skos:topConceptOf

skos:inScheme

Another scheme in FAO

Other scheme in FAO

skos:inScheme

12332

Page 27: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 28: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 29: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 30: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

• Does Concept identification in unstructured texts

• Uses Agrovoc as a controlled vocabulary

• Prototype under testing with excellent results (entire repository of ICARDA indexed)

• Will produce in future Structured RDF files that can be used to link data like “open Calais”

AgroTagger

Page 31: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 32: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 33: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

AGRIS Journal disambiguation

2.644.818 AGRIS records

2.171.113 records are journal records (82.09%)

1.788.083 journal records have been covered by the disambiguation process (82.35%)

14.658 journals have been correctly disambiguated

~20.000 strings must be examined yet: they refer to journal’s titles

Triples have been generated:

Page 34: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Triplifying AGRIS (small exemple)

<?xml version="1.0" encoding="utf-8"?><rdf:RDF xmlns:ags="http://purl.org/agmes/1.1/" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/"><bibo:Journal rdf:about="http://aims.fao.org/aos/journal/c_b6e4ca85">

<bibo:ISSN>0101-9066</bibo:ISSN><bibo:ISSN>0101-9066</bibo:ISSN><dct:title><![CDATA[Circular técnica]]></dct:title><dct:alternative><![CDATA[Circular técnica (Centro Nacional de Pesquisa de Seringueira e Dendê)]]></dct:alternative><dct:alternative><![CDATA[Circular Tecnica - Centro Nacional de Pesquisa da Seringueira e Dende]]></dct:alternative><dct:alternative><![CDATA[Circular técnica - CNPSD]]></dct:alternative><dct:alternative><![CDATA[Circ. téc.]]></dct:alternative><ags:publisherPlace rdf:resource="http://aims.fao.org/aos/geopolitical.owl#Brazil"/><dct:publisher><![CDATA[Empresa Brasileira de Pesquisa Agropecuária, Centro Nacional de Pesquisa de Seringueira e

Dendê]]></dct:publisher><dct:language>por</dct:language><dct:date>1980</dct:date><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_10795"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_4650"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_32372"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_332"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_3589"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_5556"/>

</bibo:Journal>

Page 35: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 36: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Serendipity linking

Page 37: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://aims.fao.org/aos/agrovoc/c_7825

Semantic Linking

Page 38: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

Semantic Linking

Page 39: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

Page 40: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

http://agclass.nal.usda.gov/nalt/2011.xml#1780

Page 41: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://aims.fao.org/aos/agrovoc/c_7825

AGROVOC

http://aims.fao.org/aos/agrovoc/c_12332 owl:sameAs http://eurovoc.europa.eu/219871 skos: exact match UNBIS: Toxic Substances

http://agris.fao.org/agris-search/search/display.do?f=1996/TR/TR96001.xml;TR9600026

Linking data through common URIs

http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2010:202:0011:0015:EN:PDF

http://unbisnet.un.org:8080/ipac20/ipac.jsp?session=128F308557F34.283092&profile=bib&uri=full=3100001~!685149~!1&ri=1&aspect=subtab124&menu=search&source=~!horizon

http://eurovoc.europa.eu/218754

Eurovoc TOXIC SUBSTANCES

UNBIS

http://agclass.nal.usda.gov/nalt/2011.xml#1780

NALT

http://www.agnic.org/search/CAT85822953

Page 42: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 43: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 44: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 45: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The CIARD RING

Roadmap to information nodes and gateways

Community switchboard to find data sources

Not only registry, but dynamic instrument for data linking

Page 46: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING - Charts and numbers

http://ring.ciard.net

Page 47: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING – Numbers

Number of documents potentially reachable through the services registered in the RING.

Types of service considered: document repositories and bibliographic databases.

http://ring.ciard.net/totals

Page 48: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 49: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://aims.fao.org

Page 50: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The AIMS Community

Page 51: 2011 07 keynote-ktw

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Thank You!http://www.ciard.nethttp://ring.ciard.nethttp://aims.fao.orghttp://agris.fao.org

Credits: Imma Subirats, Yves Jaques, Valeria Pesce, Fabrizio Celli, Ahsan Morshed, Catarina Caracciolo, Dickson Lukose, Gudrun Johannsen, Stefano Anibaldi, Armando Stellato, Tom Baker and many others