Metadata-powered dissemination of content

Post on 27-Jan-2015

112 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

DESCRIPTION

Presentation of cloud-based metadata aggregation infrastructures for agricultural data. Given at Alterra, University of Wageningen, The Netherlands.

Transcript

Metadata-powered dissemination of content

Nikos Manouselisnikosm@agroknow.gr

“meaningful services around high-quality

agricultural data pools”

http://wiki.agroknow.gr

• publications, theses, reports, other grey literature• educational material and content, courseware• primary data, such as measurements & observations

– structured, e.g. datasets as tables– digitized, e.g. images, videos

• secondary data, such as processed elaborations– e.g. dendrograms, pie charts, models

• provenance information, incl. authors, their organizations and projects

• experimental protocols & methods• social data, tags, ratings, etc.• …

agricultural research(+) content

• stats

• gene banks

• gis data

• blogs,

• journals

• open archives

• raw data

• technologies

• learning objects

• ………..

educators’ view

• stats

• gene banks

• gis data

• blogs,

• journals

• open archives

• raw data

• technologies

• learning objects

• ………..

researchers’ view

• stats

• gene banks

• gis data

• blogs,

• journals

• open archives

• raw data

• technologies

• learning objects

• ………..

practioners’ view

• stats

• gene banks

• gis data

• blogs,

• journals

• open archives

• raw data

• technologies

• learning objects

• ………..

is great…but its not the answer

• aim is:promoting data sharing and

consumption related to any research activity aimed at improving productivity and quality of crops

ICT for computing, connectivity, storage, instrumentation

data infrastructure for agriculture

Publisher

Date Catalog

SubjectID

AuthorTitle

we actually share metadata

e.g. an educational resource

…metadata reflect the context

…sometimes, data also included

metadata aggregations

• concerns viewing merged collections of metadata records from different sources

• useful: when access to specific supersets or subsets of networked collections– records actually stored at aggregator– or queries distributed at virtually aggregated

collections

15

typically look like this

16 Ternier et al., 2010

metadata aggregation tools

More than a harvester:

Validation Service Repository Software Registry Service Harvester

17

Powered by

workflows with commonalities

Harvesting Validating Transforming

OAI target - XMLs

IndexingStoring

Automatic metadata generation

De - duplication service

XMLs

Triplification

typical problem: computing

typical problem: hosting

to curate & preserve we need

even when machinery exists there are problems

• hardware maintenance• technical support• interoperability limitations

– no APIs for the dissemination of data across systems

• hardware costs

the cloud approach

Students

Researchers

Academics

Storage and Processing Monitoring/Management/Allocation layer

Virtualization of Infrastructure Layer

Virtual Machines

Virtualization of Infrastructure LayerVirtualized Infrastractures Management LayerGUI tools and APIs

Cloud provider A Cloud provider B Cloud provider B

what can be hosted on the cloud

• Data storage & management tools– APIs for content dissemination in large networks

• Processing & visualisation tools• Metadata aggregation infra• Search engines and apps for institutions or

communities

what data providers need

… only a browser and internet connection

CASE 1: DATA MANAGEMENT TOOL OVER THE CLOUD

Educational Pathway Authoring Tool

Educational Pathway Authoring Tool

Cloud service workflow

comparing costs for hosting data management tool at own site and cloud

Cloud•cloud hosting = 20 euros/month•set up effort = 1hr•back up included

•Total for 5 years = 1200 euros

Hosting at institution•1 server+monitor+ups = 1200 euros•set up > 1 day effort or 100 euros•hardware maintenance effort = difficult to be defined but significant

•Total for 5 years = 1300 +personnel for hardware maintenance+ costs of unexpected HW breakdowns e.g. supplier, hard disk

Costs of software support could be the same for both cases

Costs of software support could be the same for both cases

After 5 years the HW should be renewed/upgraded

After 5 years the HW should be renewed/upgraded

CASE 2: SETTING UP SEARCH SERVICE/PORTAL OVER THE CLOUD

demo• GLN backbone (http://www.greenlearningnetwork.com)

• Organic.Edunet revamp (http://www.greenlearningnetwork.com/organicedunet)

• AgShare Find OER (http://greenlearningnetwork.com/agshareoer)

how it works

Metadata aggregator for educational content

Search API

Template customizationhtml, css, Ajax, JS

Clo

ud

Educational collection management tool

Metadata aggregator for other data types

Search API

Data management tool

Inst

itutio

n

how it works

Metadata aggregator for educational content

Search API

Template customizationhtml, css, Ajax, JS

Clo

ud

Educational collection management tool

Metadata aggregator for other data types

Search API

Data management tool

widget in Facebook page

next challenges

1. Social Research Networking• Connecting peers & visualising social

networks, connecting researchers with publications, recommending relevant research– Mendeley (www.mendeley.com), ResearchGate

(http://www.researchgate.net), Academia.edu (http://academia.edu), ArnetMiner (http://arnetminer.org), …

– Social research components in popular CMSs (JomSocial, Drupal’s Buddylist, Elgg…)

connect peers/publications (+APIs)

http://dev.mendeley.com/

extending social CMS components

http://voa3r.cc.uah.es

2. Enriched research objects• Complex, linked research objects

– executable scientific workflows, e.g. MyExperiment (http://www.myexperiment.org), Kepler (https://kepler-project.org)

– data sets e.g. PLoS (http://www.plos.org), FigShare (http://figshare.com)

– processing web services e.g. BioCatalogue (http://www.biocatalogue.org)

– Scientist generated classifications/taxonomies e.g. Scratchpads (http://scratchpads.eu)

– thematic networks/catalogues e.g. TELeurope (http://www.teleurope.eu), VOA3R (http://voa3r.cc.uah.es)

composite/networked research

http://education.natural-europe.eu/green/exhibits/show/grape-cultivars/to-begin-with

3. End-user interfaces and access• Facilitating and monitoring usage and access

– Visualising social bookmarks (Klerx & Duval)– TinyArm (http://atinyarm.appspot.com) – MACE (http://portal.mace-project.eu) and maeve

interactive installation at Venice Biennale (http://vimeo.com/1738770)

research visualisations & analytics

interactive navigation interfaces

METADATA AGGREGATOR

thank you!nikosm@agroknow.gr

http://wiki.agroknow.grhttp://aginfra.eu

top related