Top Banner
CLARIN Technical Infrastructure PIDs - How far are we?
12
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: CLARIN Technical Infrastructure PIDs - How far are we?

CLARIN Technical Infrastructure

PIDs - How far are we?

Page 2: CLARIN Technical Infrastructure PIDs - How far are we?

Usage I

• assume that we have a recording of an extinct language and some

annotations that tell us what someone said about medicine etc• researchers create relations that need to be preserved Video Recording

Sound Recording

Annotations

Recording Session

fromReposit

oryA

fromReposit

oryB

fromReposit

oryC

How long?

Page 3: CLARIN Technical Infrastructure PIDs - How far are we?

Usage IIBiological and cultural processes have evolved together, in a symbiotic spiral; they are now indissolubly linked, with human survival unlikely without such culturally produced aids as clothing, cooked food, and tools. The twelve original essays collected in this volume take an evolutionary perspective on human culture, examining the emergence of culture in evolution and the underlying role of brain and cognition. The essay authors, all internationally prominent researchers in their fields, draw on the cognitive sciences -- including linguistics, developmental psychology, and cognition -- to develop conceptual and methodological tools for understanding the interaction of culture and genome. They go beyond the "how" -- the questions of behavioral mechanisms -- to address the "why" -- the evolutionary origin of our psychological functioning. What was the "X-factor," the magic ingredient of culture -- the element that took humans out of the general run of mammals and other highly social organisms?

Several essays identify specific behavioral and functional factors that could account for human culture, including the capacity for "mind reading" that underlies social and cultural learning and the nature of morality and inhibitions, while others emphasize multiple partially independent factors -- planning, technology, learning, and language. The X-factor, these essays suggest, is a set of cognitive adaptations for culture.

ePublicationRepository 1

eResource Repository 2

How long?

Page 4: CLARIN Technical Infrastructure PIDs - How far are we?

Usage III

eResource2 Repository 2

Ontologyopen registry

How long?

eResource1Repository 1

Page 5: CLARIN Technical Infrastructure PIDs - How far are we?

Usage IV

eResource eResource How long?

RDF Assertion

Page 6: CLARIN Technical Infrastructure PIDs - How far are we?

Currently almost 1 Mio PIDs

<?xml version="1.0" encoding="UTF-8"?><METATRANSCRIPT ArchiveHandle="hdl:1839/00-0000-0000-0005-82B0-2" Date="2006-07-18" FormatId="IMDI 3.0" Originator="Editor - Profile:SESSION.Profile.xml" Type="SESSION" Version="1"

xmlns="http://www.mpi.nl/IMDI/Schema/IMDI" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.mpi.nl/IMDI/Schema/IMDI ./IMDI_3.0.xsd">

<Session> <Name>DBD_RIF_14_12_01_064</Name> <Title>Dutch Bilingualism Database, Ethnic Dutch, Session 64</Title>

……….

<MediaFile><ResourceLink ArchiveHandle="hdl:1839/00-0000-0000-0004-DC6B-0">

http://corpus1.mpi.nl/qfs1/media-archive/dbd_data/boumans/T-Cult/Metadata/../Media/dbd_rif_14_12_01_064.wav</ResourceLink>……….

Page 7: CLARIN Technical Infrastructure PIDs - How far are we?

The Problem

• could use Cool URIs as the W3C TAG suggests to do• addresses change too often and we cannot influence that • perhaps some exceptions such as

http://www.isocat.org/datcat/DC-1708

???

???

???

you just change one entry in a database

but there is a price of course

Page 8: CLARIN Technical Infrastructure PIDs - How far are we?

Many Suggestions

• URLs: http:/www.mpi.nl/imdi/doc/white-paper all

• HTTP URIs: http://www.isocat.org/isodcr#12345 W3C

• URNs: urn:nbn:nl:ui:13-54321 EU Libs etc

• Handles: hdl:1839/00-0000-0000-0005-82B0-2 many

• ARKs: http://ark.cdlib.org/ark:/13030/ft4w10060w few

• XRIs: xri://broadview.library.example.com/ ?(urn:isbn:0-395-36341-1)

• PURLs: http://purl.oclc.org/OCLC/PURL/FAQ many• DOI: Handles + Business Model

Publisher• OpenURLs: parameterized http-get requests ?• InfoURIs integrate legacy material into Web ?

• etc

Page 9: CLARIN Technical Infrastructure PIDs - How far are we?

Evaluation

Standard Robust Software

Resolution System

Resolution Type

Security Admin

Assoc Info

Cost

URL RFC2616 no yes (DNS) single no no no

URN:ISSN ISO2397 no no ? no no no

URN:ISBN ISO2108 no no ? no no no

URN:NBN RFC3188 no no ? no no ?

PURL no no yes single no no no

Handle RFC3650 yes yes multiple yes yes little

DOI Z39.84… yes yes (Handle) multiple yes yes large

ARK no no (yes) multiple (no) yes ?

info URI RFC3668 no no ? no no no

XRI no no no ? no ? ?

simple decision: need to have something robust now without expensive business model and dependencies

Page 10: CLARIN Technical Infrastructure PIDs - How far are we?

How to do

• you need to be registered at the PID service as accepted and trusted partner(trusted partners are only those who can demonstrate that they have a proper repository system)

• you have a set of resources which have URLs • these resources have registered metadata descriptions • you request for these resources PIDs by submitting the

requested information such as URLs, MD5, minimal MD etc• you can do this either manually or via an API

• you get back the PIDs from the service• you enter these PIDs in the metadata description field

(now everyone can use it for reference purposes)

• whenever you change the URLs you need to adapt the entry (probably use a ready-made mover)

Page 11: CLARIN Technical Infrastructure PIDs - How far are we?

Associated information

• want to check authenticity before copying etc (MD5 field)• want to add citation data info extracted from metadata

records• want to solve the problem of having several centers

manipulating the Handle record without interference• want to add a pointer to access permission information • proper monitoring services

• in MPG and CLARIN some money to create robust services

Page 12: CLARIN Technical Infrastructure PIDs - How far are we?

Short Overview

• GWDG Servicehttp://handle.gwdg.de:8080/pidservice

• Java-Documentationhttp://handle.gwdg.de/javadocs/

• 11858/00-ZZZZ-0000-0000-000C-7 -> 'View Handle'• http://www.gwdg.de/aktuell/index.html -> 'Find Handle'

• service will be given also to CLARIN and probably for other research Infrastructure initiatives in Europe