Top Banner
Data Cube Vocabulary Workshop 26 th May 2015, Luxembourg Open Data Communities Evangelos Kalampokis & Bill Roberts
21
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: OpenCube and the OpenDataCommunities

Data Cube Vocabulary Workshop26th May 2015, Luxembourg

Open Data CommunitiesEvangelos Kalampokis & Bill Roberts

Page 2: OpenCube and the OpenDataCommunities

Eurostat Workshop 2

This work is funded by the European Commission within the 7th FP in the context of the project OpenCube under grand agreement No. 611667.

Project Coordinators: Prof. Konstantinos Tarabanis, CERTH, e-

mail: [email protected] Ass. Prof. Efthimios Tambouris, CERTH, e-

mail: [email protected] Project Officer

Carola Carstens Duration:

November 2013 - October 2015 26th May 2015

The OpenCube project

Page 3: OpenCube and the OpenDataCommunities

Eurostat Workshop 326th May 2015

Partners

Page 4: OpenCube and the OpenDataCommunities

Eurostat Workshop 4

Linked Data has the potential to enable combining and performing analytics on top of disparate and previously isolated statistical data

The RDF Data Cube Vocabulary has been proposed for modelling multi-dimensional data as RDF graphs.

However, tools for handling linked data cubes:

are only few and scattered

have not been tested under real-life conditions

26th May 2015

Linked Data

Potential of using LOD in statistical data analysis unexploited

Page 5: OpenCube and the OpenDataCommunities

Eurostat Workshop 5

Facilitate publishers to create linked data cubes from legacy formats Empower users to browse, visualise, link, expand and analyse data

cubes. 26th May 2015

OpenCube benefits to stakeholders

Enable analysis not possible before (merging data cubes across the Web) Lower entry barrier to SMEs to exploit this new technology

Page 6: OpenCube and the OpenDataCommunities

Eurostat Workshop 626th May 2015

OpenCube approach & results

Page 7: OpenCube and the OpenDataCommunities

726th May 2015 Eurostat Workshop

Prototypes and Developed Components

Page 8: OpenCube and the OpenDataCommunities

826th May 2015 Eurostat Workshop

Prototypes and Developed Components

Standalone Components

TARQL data cube extension

D2RQ data cube extension

Grafter data transformation pipeline

Page 9: OpenCube and the OpenDataCommunities
Page 10: OpenCube and the OpenDataCommunities

Statistics in Open Data Communities

Open Data Communities holds around 100 statistical datasets, all in the form of RDF Data Cube.

Each dataset has machine readable metadata, using the DCAT and VoID vocabularies.

Page 11: OpenCube and the OpenDataCommunities

▪Most datasets have dimension of refArea, refPeriod and one or more other dimensions

▪Geography is defined using the UK Office for National Statistics standard codes for areas

▪Time periods are defined using the UK government 'reference.data.gov.uk' time interval

▪Other code lists are expressed as SKOS Concept Schemes and are generally defined by DCLG themselves

▪Opportunities▪The ability to share concept schemes between organisations, to agree on

standard definitions

9 December 2014 OpenCube First Review

Use of RDF Data Cube

Page 12: OpenCube and the OpenDataCommunities

▪Data analysts and researchers From local government From universities From third sector From businesses From DCLG itself

Local government and other organisations that want to re-use statistics to compile and display 'area profiles'

Developers incorporating data into visualisations

9 December 2014 OpenCube First Review

User groups

Page 13: OpenCube and the OpenDataCommunities

9 December 2014 OpenCube First Review

Data access methods

Page 14: OpenCube and the OpenDataCommunities

▪OpenCube project has led to development of new data access components, used in Open Data Communities:

Grid view of any data cube dataset

Map view of any data cube dataset

'Spreadsheet builder', combining data from multiple cubes

9 December 2014 OpenCube First Review

Components developed in OpenCube

Page 15: OpenCube and the OpenDataCommunities

9 December 2014 OpenCube First Review

Data cube grid view

Page 16: OpenCube and the OpenDataCommunities

Tools to select the two dimensions to show – and to fix the values of any dimension not shown.

9 December 2014 OpenCube First Review

Data cube grid view – selecting dimensions

Page 17: OpenCube and the OpenDataCommunities

And importantly, the user's selection of data can be downloaded in CSV, for easy use in other software packages

9 December 2014 OpenCube First Review

Data cube grid view – data download

Page 18: OpenCube and the OpenDataCommunities

9 December 2014 OpenCube First Review

Data cube map view

Page 19: OpenCube and the OpenDataCommunities

9 December 2014 OpenCube First Review

Spreadsheet builder

Page 20: OpenCube and the OpenDataCommunities

▪DCLG have selected the linked data('5 star open data') as a good way to manage and distribute their open data

▪Much of the data is statistical and the RDF Data Cube is a well-established W3C open standard

▪The combination of linked data and the standardised data cube structure allows many opportunities for automation

▪While most users don't want to consume linked data directly, it provides a platform for building many other kinds of data access

9 December 2014 OpenCube First Review

Conclusions

Page 21: OpenCube and the OpenDataCommunities

Eurostat Workshop 2126th May 2015