DANS is an institute of KNAW and NWO
Data Archiving and Networked ServicesData Archiving and Networked Services
Enhanced Publications:an introduction
Arjan Hogenaar
11 June 2012
DANS
Data Archiving and Networked Services
• Sustainable access to digital research data
• Promoting trusted digital repositories (DSA)
But also
• Services like EASY, NARCIS and National Resolver
2
Enhanced Publications
(Traditional) publications enhanced with:
- Datasets
- Video fragments
- Audio fragments
- Images
- Detailed Information on Organisation/Author
- In order to clarify the context
- Not all these components have to be included in an EP
3
Two fundamental ways to compose EPs
A. ‘Machine-based composition’
• The objects brought together shared already one
or more properties
• Examples:– ARVODI (in the Netherlands): project description, publication
and dataset share a unique identifier
- OpenAIREPlus: the EU Grant agreement number
• Relationships between the objects are ‘unbiased’
4
Two fundamental ways to compose EPs
B. ‘Man-made composition’
• The objects brought together based on the opinion
of the composer
• Not always clear why a typical object (an image)
has been related to – for instance – a publication
5
Is there a relationships between these two teams?
Denmark Netherlands
6
OAI-ORE (Object Re-use and Exchange) for EPs (some basics)
• Resource Map to describe an EP (in OAI-ORE: an aggegration)
• Aggregation to describe the components of an EP (in OAI-ORE: the aggregated resources)
• Aggregated resources may be documents (Eprints) , datasets (data objects), images, metadata records, authors (persons) and so on
7
Data Model of EPs
Van de Sompel: OAI-ORE Primer
8
Advantages EPs
• Background information easy to find
• Conclusions in a paper may be verified via the dataset(s) used
• Information is being presented in context
9
Additional Advantages of man-
made EPs
• Authors may not simply relate components, but add comment why they have chosen to relate them
• An author may allow other researchers to add components to the original EP: an EP is no longer a static document
• But be aware: in the case of man-made EPs it is not always clear why objects have been put together into one single EP!
10
Example of an EP: textual representation
Not very revolutionary! But…. See the xml!
11
Rdf/xml serialisation of the OAI-ORE of an EP:
<rdf:RDF>
<rdf:Description rdf:about="http://escape.utwente.nl/rem/561">
<rdf:type rdf:resource="http://www.openarchives.org/ore/terms/ResourceMap"/> <dcterms:created>2011-02-16T14:43:57.756Z</dcterms:created>
<dcterms:modified>2011-05-05T08:09:09.571Z</dcterms:modified>
<ore:describes rdf:resource="http://escape.utwente.nl/aggregation/561"/> <dcterms:creator rdf:nodeID="183fb582"/>
</rdf:Description>
<rdf:Description rdf:nodeID="183fb582">
<foaf:name>ESCAPE repository</foaf:name>
<foaf:page rdf:resource="http://escape.utwente.nl/"/>
</rdf:Description>
<rdf:Description rdf:about="http://escape.utwente.nl/aggregation/561">
etc
12
EPs and DANS
DANS has three major services:
– The portal NARCIS
– The Archiving System for Datasets: EASY
– The National Resolver
13
EASY and Resolver
• EASY is used both as a system to deposit datasets and as an access point to search for datasets: https://easy.dans.knaw.nl/ui/home
• The Resolver is a new service, resolving the persistent identifier of an object to its actual URL. The Resolver is crucial in the realisation of sustainable access to information objects:
http://persistent-identifier.nl/
14
NARCIS
• Main national portal to research information, publications and datasets (from EASY and 3TU.Datacentrum)
• Information gathered using OAI-PMH protocol
• Is a starting point for the developments of EPs, as most of the EP-components (project descriptions, descriptions of researchers and institutions, metadata of publications and datasets) have been incorporated in the portal:
www.narcis.nl15
EPs in NARCIS
• In co-operation with SURF in 2011:
– Study into the possibilities to present EPs in NARCIS
– under the following conditions:
• OAI-PMH to harvest descriptions of EPs
• OAI-ORE to describe the relationships between objects within EPs or between different EPs
• RDF/xml for the serialisation of EPs
• Incorporation of a visualisation tool to present EPs
• Use of persistent identifiers for authors and objects
16
Only use this slide to present a screenshot of an application. As no style is applied, the screenshot can take up the whole slide. For all other information please use the slide with preset style!
Visualisation of an EP in NARCIS: from RDF/xml to user-friendlyness
17
Digital Author Identifier:its role in EPs
* No doubt on the identity of an author
* Centralised system in the Netherlands
* May be used in publications, datasets,
project descriptions and of course in EPs
R.L. Zijdeman has the DAI:
304832960
18
Persistent Identifiers:separating a resource form its actual URL
• URLs of object is not stable
• But: assigning a PID to an object may overcome this problem
• The combination of the object, its PID and an resolver service (to translate the PID in the actual URL of the object) facilitates sustainable access
A thesis and its PID as part of an
EP in NARCIS
19
EP: summary
1. Total new way in information dissemination
2. EP’s may be deposited in repository just like traditional publications
3. OAI-PMH may be used for the harvesting of description of metadata of EP’s
4. Description of EPs relies on OAI-ORE model (with resource maps and aggregations)
20
Two key aspects in EP development
• Digital Preservation
• Data Curation
21
Digital Preservation
Components of an EP have to be sustainable available
– For publications:
electronic depots of national libraries
assigning Persistent Identifiers to the objects
secure information infrastructure
- For datasets: Data Curation
- Data management plan
- Data Seal of Approval (DSA)
22
Digital Curation of Datasets
A. Data Management Plan (at the start of a project) :
• Describing nature of data
• Describing way of data collection
• Hard- and software to be used
• Describing intellectual property rights and legal requirements
• Describing (re-)use of data
23
Digital curation of Data
B. Data Seal Of Approval (DSA)http://datasealofapproval.org/
• Ensuring sustainable access to research data
• Guidelines for:– Data producer
– Data archive
– Data consumer
• Self-assessment, peer reviewed by DSA board, whereafter DSA wil be granted
• Trust is crucial 24
Open Access (as seen from a funder)
NWO (Netherlands Organisation for Scientific Research) promotes Open Access in two ways:
1. Open Access to research publications
2. Open Access to research data
25
NWO OA policy: publications
• Publications as results of research projects supported by NWO should, irrespective of other publication possibilities, be made accessible to the general public as quickly as possible via Open Access.
• Copyrights may only be assigned to a third party insofar as this does not block the option to publish via Open Access
• Deviation from this rule is only possible with the explicit prior approval of NWO.
26
NWO OA policy: research data
Aim: Maximise access to data for re-use (‘Open
Access unless specified otherwise’)
Means: Setting up an Open Access Code of Conduct
to make rights and obligations visible to all
stakeholders
Legally: NWO is considered to be the co-producer of
data generated as a result of a NWO-grant
Practice:Data collected during a research project have
to be stored and documented in a thorough
and accessible manner
27
Other challenges
1. Authors may combine components from different sources
2. Therefore, sustainable access not guaranteed for all components
3. Maintenance of dynamic composed publications is technically no problem, but the versioning is.
4. Copyright rules may vary for the different components
28
Data Archiving and Networked Services
Anna van Saksenlaan 10, 2593 HT The Hague. P.O. Box 93067, 2509 AB The Hague.
T +31 (0)70 3446 484, F +31 (0)70 3446 482, E [email protected]
Thank you for your attention
For more information please contact
29