Institute of Information Systems & Information Management riese – RDFizing & Interlinking the EuroStat Dataset Effort Wolfgang Halb (JOANNEUM RESEARCH), Yves Raimond (Queen Mary University of London) and Michael Hausenblas (JOANNEUM RESEARCH) 2008-01-30
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Institute of Information Systems & Information Management
riese – RDFizing & Interlinking the EuroStat Dataset Effort
Wolfgang Halb (JOANNEUM RESEARCH), Yves Raimond (Queen Mary University of London) and Michael Hausenblas (JOANNEUM RESEARCH)
2008-01-30
2
Agenda LinkingOpenData Eurostat (http://ec.europa.eu/eurostat) Architecture Schema & Data Demo Inside
3
LinkingOpenData: Principles Items should be identified using URI references [
URIrefs] (and: don’t use bNodes); URIrefs should be dereferenceable: using HTTP
URIs allows looking up the items identified through URIrefs, cf. [http-range-14 TAG finding];
Looking up an URIref it leads to more data [follow-your-nose principle];
Links to other URIrefs should be included in order to enable the discovery of more data [How to Publish Linked Data on the Web]
4
LinkingOpenData: Current State
5
LinkingOpenData: Current State
in less than a year an emerging community (cf. [LOD ESWiki] created approx. 4 billion triples and approx. 3 million interlinks in
25 separate data sets held diverse F2F meetings, presentations, etc. upcoming: LDOW08 workshop at WWW08
6
Eurostat Eurostat (http://ec.europa.eu/eurostat) publishes statistics in these themes:
General and regional statistics Economy and finance Population and social conditions Industry, trade and services Agriculture and fisheries External trade Transport Environment and energy Science and technology
about the European Union in detail and additional statistics for major non-European countries
7
Eurostat data dump provided as download (TSV-files) updated twice a day additionally needed:
dictionary files to translate the data codes used table of contents for structure
Size of Eurostat data 5 GB data dump in approx. 4,000 files 350 million data values 80,000 different data codes