D3.3: Report on data sharing policies Author: Kate Fernie, MDR Ariadne is funded by the European Commission’s 7th Framework Programme.
Jun 29, 2015
D3.3: Report on data sharing policies
Author:
Kate Fernie, MDR
Ariadne is funded by the European Commission’s 7th Framework Programme.
ARIADNE D3.3 (Public)
2
Author: Kate Fernie, MDR
Contributors: Guntram Geser, SRFG
Elizabeth Fentress, AIAC
Costis Dallas, Athena-‐RC
Franco Niccolucci, PIN
Cesar Gonzalez-‐Perez, CSIC
Roberto Scopigno, Paolo Cignioni, ISTI-‐CNR
Ulf Jakobsson, SND
Emmanuelle Bryas, Amala Marx, Kai Salas-‐Rossenbach, Bernard Pinglier, INRAP
Hella Hollander, KNAW-‐DANS
ADS, Discovery, ZRC SAZU, CYI-‐STARC, ARHEO, MNM-‐NOK, OEAW, ARUP-‐CAS, NIAM BAS, MiBAC, DAI
Version 1.0 (final) 27th January 2014
ARIADNE is a project funded by the European Commission under the Community’s Seventh Framework Programme, contract no. FP7-INFRASTRUCTURES-2012-1-313193. The views and opinions expressed in this presentation are the sole responsibility of the authors and do not necessarily reflect the views of the European Commission.
Partner in charge of the deliverable:
MDR Partners (Consulting) Ltd
Quality review:
Julian Richards and Holly Wright, UoY
-‐ADS
ARIADNE D3.3 (Public)
3
Table of Contents
1 Executive summary ........................................................................................................... 5
2 Introduction ...................................................................................................................... 7
3 Methodology .................................................................................................................... 8
4 Sharing knowledge: Open Data ....................................................................................... 10 4.1 Open Access Publications ............................................................................................................. 10 4.2 Open licences ............................................................................................................................... 11 4.3 Linked Open Data ......................................................................................................................... 14 4.4 Attribution of research data ......................................................................................................... 16
5 Situational analysis ......................................................................................................... 17 5.1 How do we define research data in archaeology? ....................................................................... 17 5.2 How and when does openness need to be limited? .................................................................... 18
5.2.1 Active research projects ........................................................................................................................ 18 5.2.2 Past research projects ........................................................................................................................... 19 5.2.3 Database rights ...................................................................................................................................... 20 5.2.4 Archaeological site location data ........................................................................................................... 20 5.2.5 Commercial value .................................................................................................................................. 20 5.2.6 Privacy and data protection ................................................................................................................... 21 5.2.1 National legislation ................................................................................................................................ 21
5.3 How should the issue of data re-‐use be addressed? .................................................................... 23 5.3.1 Licensing ................................................................................................................................................ 23 5.3.2 Data citation .......................................................................................................................................... 24 5.3.3 Should ARIADNE adopt Creative Commons licences for resource discovery metadata? ...................... 25
5.4 How should we enhance data awareness and the culture of sharing? ........................................ 26
6 Survey of ARIADNE datasets ........................................................................................... 28 6.1 Rights holders ............................................................................................................................... 28 6.2 Content copyright ......................................................................................................................... 29 6.3 Content Access ............................................................................................................................. 29 6.4 Metadata rights ............................................................................................................................ 31 6.5 Specific conditions affecting Access ............................................................................................. 32
7 Discussion ....................................................................................................................... 33 7.1 Deposit agreements with content providers ................................................................................ 33
ARIADNE D3.3 (Public)
4
7.2 Agreements with ARIADNE .......................................................................................................... 33 7.3 Data sharing and access ............................................................................................................... 34 7.4 Licence framework ....................................................................................................................... 35
7.4.1 Resource description/Collection description metadata ........................................................................ 35 7.4.2 Content licensing ................................................................................................................................... 35
8 Recommendations .......................................................................................................... 37
9 References ...................................................................................................................... 38
Glossary ................................................................................................................................ 41
Appendix 1: Ariadne questionnaire on datasets, metadata and data sharing policies ........... 42 Rights holder(s) -‐ The owner(s) of the rights of the content being provided ........................................ 42 Content copyright .................................................................................................................................. 44 Content Access rights ............................................................................................................................ 47 Use of standard licences ........................................................................................................................ 50 Metadata rights ..................................................................................................................................... 52
Appendix 2: DANS Licence Agreement and help text ............................................................. 54
Appendix 3: The Terms of Use and Access to ADS Resources ....................................... 58
Appendix 4: Accessibility levels at SND ........................................................................... 62
Appendix 5: Data.Gouv.FR – Open Licence ...................................................................... 64
ARIADNE D3.3 (Public)
5
1 Executive summary
The ARIADNE network developed out of the need to develop infrastructures for the management and integration of archaeological data at a European level. The network brings together existing archaeological research datasets with the aim of making them more accessible to researchers, and to build a better understanding of how this data might be brought together to create new insight and understanding within archaeology. To achieve this ARIADNE needs to consider the data access and sharing policies relevant to archaeological research datasets.
This report provides an introduction to ARIADNE, and the methodology used to collect information and inform its findings.
Following on from the introduction in Section 2, and description of the methodology used in Section 3, the context of the move towards open access for research publications and datasets is considered in Section 4, within which the 2003 Berlin Declaration on Open Access to Knowledge in the Sciences and Humanities was an important milestone. Open Access publications have changed the subscription model providing researchers, students, teachers and members of the public with free access to the latest research. The development of licences, such as those prepared by Creative Commons and the Open Data Commons, is helping data creators share their results in a way that makes conditions for use and re-‐use clear to the public. Technical developments are both facilitating data sharing and enabling data citation, which is important in allowing academic recognition for these new forms of publication.
Section 5 is a situational analysis based on a consultation of ARIADNE partners to understand thinking amongst the archaeological research community on data sharing. Research data in archaeology is defined, and the circumstances in which access to archaeological data needs to be limited are explored. These include, amongst others, the sensitivity of some sites to treasure hunters, national legislation, commercial value, active research projects and complications over the management of rights in legacy datasets. The licensing of data for re-‐use, open licensing of resource discovery metadata and ways of enhancing data awareness and the culture of data sharing are discussed.
Section 6 considers the results of a survey of the sharing policies in place for the datasets ARIADNE partners plan to provide for integration within the research infrastructure. The survey revealed that almost 90% of the datasets are accessible online, with 50% openly available to public users without registration. Over half the datasets are made available under Creative Commons, or other forms of open licences.
ARIADNE D3.3 (Public)
6
Section 7 discusses the issues identified in the report. Consultation with partners revealed that access and sharing policies are still evolving. The aim of this report is to help establish best practices in the management of rights and data access amongst partners and the wider community. This means considering the whole data sharing chain, from the archaeological researcher depositing their data with an archive, to its integration in the ARIADNE infrastructure, and its subsequent availability to the research community. The licence framework discussion covers both data deposits and access policies, and both content and resource description (discovery) metadata. Our survey confirmed that Creative Commons licences are being widely adopted, although there are differences in the conditions specified. The potential impact of the main licence conditions (Attribution, Share Alike, No Derivatives and Non-‐Commercial) on data sharing in ARIADNE are considered.
Finally, Section 8 recommends that ARIADNE include in its data sharing policy framework:
• A common method of data citation for adoption by partners, as the means of ensuring academic recognition is important in motivating researchers to share their datasets;
• Allocation of DOIs (or the equivalent) for datasets ingested to the ARIADNE infrastructure; persistent identification underpins data sharing and data citation;
• The use of the Creative Commons licence suite (version 4.0 is preferred) for content provided to ARIADNE; CC BY is recommended for open access;
• A collection description be provided with each collection provided to ARIADNE and licensed under a CC BY licence;
• Item level metadata records be published under a CC0 (public domain) licence to enable integration of multiple datasets, to support resource discovery and enable linked open data.
ARIADNE D3.3 (Public)
7
2 Introduction
The amount of data being produced by archaeological research projects has increased exponentially over the last ten years. Archaeologists are pushing the boundaries of available computing resources in the course of their work, generating significant amounts of primary research data.
The ARIADNE network developed out of a need to develop infrastructures for the management and integration of archaeological data at a European level. With funding from the European Commission’s Seventh Framework Infrastructures programme, ARIADNE brings together existing archaeological research datasets and infrastructures with the aim of making them more accessible to researchers, and to build a better understanding of how this data might be brought together to create new insight and understanding within archaeology. There is now a large availability of archaeological digital datasets that, together, span different periods, domains and regions, and more are continuously created as a result of the increasing use of IT. These are the accumulated outcome of the research of individuals, teams and institutions.
Traditional approaches to research protect the intellectual property rights of individual researchers. Sometimes this protection extends beyond a reasonable term, for example in the case of excavations unpublished for decades, and primary data still under study by the archaeologist. By contrast, sharing data was perceived as interesting and useful by the majority of respondents to a survey completed by the ADS in 2007 [1]: which included comments like ‘having such data available will assist any longer-‐term monitoring projects or even cast new light on a previously recorded subject’.
ARIADNE aims to bring together and integrate the existing archaeological research data infrastructures, so that researchers can use the various distributed datasets. It is developing tools and services to provide access and common interfaces to data repositories, and will support the integration of datasets to enable access by the community of archaeological researchers.
To achieve this, ARIADNE needs to consider the data access and sharing policies relevant to archaeological research datasets. This report begins by considering the broad context of the move towards open access for research publications and data, next the situation in relation to providing access to archaeological datasets is explored, and last, but by no means least, the sharing policies in place for the datasets that ARIADNE partners plan to provide for integration to the research infrastructure are examined.
ARIADNE D3.3 (Public)
8
The principle purpose of this report is to define policies for data access via the ARIADNE infrastructure that take into account the requirements defined by the owners of IPR on the content and reflect EU strategic policies of Open Access to Research Data [2] [3].
“The best research infrastructures support researcher collaboration in virtual research communities where knowledge sharing between the best brains is combined with open access to research results and state-‐of-‐the-‐art computing systems to support the efficiency and creativity of research in Europe.” Neelie Kroes, Vice President of the European Commission talking about research infrastructures and the potential of e-‐Science in today’s society in an interview1. Kroes went on to say “Open research data could help combine and share the works of different research groups, thereby creating new collaborations and tackling new issues for solving common challenges.”
3 Methodology
A key element to the approach in gathering data for this report has been to understand the requirements of the ARIADNE partners.
A questionnaire was sent to all ARIADNE content providing partners to collect information about the datasets they plan to provide for integration with the research infrastructure. The results of the survey were used to inform both this deliverable on data sharing policies, and also Deliverable 3.2, which describes the metadata standards and thesauri in use by the consortium [4]. The survey revealed the heterogeneous nature of the datasets being made available for integration, as several partners hold data collections that include deposits by many different archaeologists working within their countries, and beyond. It also provided useful information about the strategies in place for managing copyright and licensing access to both content and metadata amongst these collections.
Following the initial analysis of the results of the survey of datasets, a second survey was carried out to gather partners’ opinions on questions relating to providing open access to research datasets. This survey was open to all partners, including those who do not currently plan to provide datasets to the infrastructure. It invited partners to define what they mean by research data and to discuss when openness needs to be limited and why, how the issue of data re-‐use should be addressed and how to enhance the culture of data sharing. The results provide valuable information about the context of archaeological research.
1 E-‐Data & Research, Newsletter on data and research in the Social Sciences and Humanities, Special Issue 2014
ARIADNE D3.3 (Public)
9
The third strand to the methodology involved desk research to gather information about the context of data sharing, and developments in policy and practice.
ARIADNE D3.3 (Public)
10
4 Sharing knowledge: Open Data
Technology is changing the way research is carried out, and the way that its results are published. It is creating new possibilities for sharing research data, and this brings with it a requirement for new thinking on data access policies. In this section we consider what open data means within the context of ARIADNE.
The 2003 Berlin Declaration on Open Access to Knowledge in the Sciences and Humanities is one of the milestones of the open access movement, and sets out steps to support the transition to open access publication on the Internet for the producers of scientific knowledge.
OpenAIRE (the Open Access Infrastructure for Research in Europe) defines Open Access as “the immediate, online, free availability of research outputs without restrictions on use commonly imposed by publisher copyright agreements. Open Access includes the outputs that scholars normally give away for free for publication; it includes peer-‐reviewed journal articles, conference papers and datasets of various kinds” [6]. OpenAIRE suggests that the benefits include:
• improvements in access as the basis for teaching, research and valorization for civil society;
• increased visibility and higher citation rates for researchers; • free access to content worldwide.
The Open Definition [7) sums up the meaning of open data as “a piece of data or content is open if anyone is free to use, reuse, and redistribute it — subject only, at most, to the requirement to attribute and/or share-‐alike". Openness in this, and other definitions, means data is made available under licence conditions that permit re-‐use for free (or at no more than reasonable reproduction costs) and preferably via the Internet.
4.1 Open Access Publications
Open Access Publications break the traditional subscription model of academic publishing. In the print publication world, the publisher owned the rights to articles in their journals and charged readers for access. In the Open Access world of digital publication, by shifting publishing costs to the author/funding bodies and by using open licences, readers are able to obtain content at no cost. The benefits of this approach include:
• Researchers can read the findings of others without restriction • Opening up public access to the results of publicly funded research
ARIADNE D3.3 (Public)
11
• Students and teachers have access to the latest research findings from across the world
It is worth noting that the Open Access publishing model covers a range of components including reader rights, re-‐use rights, copyright, author posting rights, automatic posting and machine readability. Publishers and funding bodies have differing policies on these components that affect the degree of openness of individual articles or whole journals [8].
4.2 Open licences
Open licences are those which permit re-‐use of data for free, and in principle this definition could include any royalty-‐free copyright licence. However such licences might not conform to all of the principles set out in the Open Definition, which identifies a series of conformant licences2 set out in the table below.
Licence Domain BY SA Comments
Creative Commons CCZero (CC0)
Content, Data
N N Public Domain Dedication -‐ all rights are waived including attribution. Fully open, anybody can do anything with the data.
Open Data Commons Public Domain Dedication and Licence (ODC PDDL)
Data N N Places the data in the Public Domain – all rights are waived
Creative Commons Attribution
(CC-‐BY 1.0, 2.0, 2.5, 3.0, 4.0)
Content Y N
All versions of CC-‐BY allow redistribution and reuse of a work on condition that the creator is appropriately credited (attribution).
CC-‐BY credits the original data producer, which is an important motivation for sharing the data.
Open Data Commons Attribution License
Data Y N The data(base) is made available on condition that the creator is credited (attribution for data(bases)).
2 http://opendefinition.org/licenses/
ARIADNE D3.3 (Public)
12
(ODC-‐BY)
Creative Commons Attribution Share-‐Alike (CC-‐BY-‐SA 1.0, 2.0, 2.5, 3.0, 4.0)
Content Y Y
All versions of CC-‐BY-‐SA allow re-‐distribution and re-‐use of a licensed work on condition that the creator is appropriately credited, and that any derivative work is made available under “the same, similar or a compatible license”. Version 1.0 is little used and not recommended by the Open Definition because it is incompatible with future versions
Open Data Commons Open Database License
(ODbL)
Data Y Y
The data(base) is made available on condition that the creator is credited, and any derivatives are made available under “the same, similar or a compatible license” (attribution and ShareAlike for data(bases)). The condition “share-‐alike” limits re-‐use and thus the content is less open and should be avoided for Linked Data.
Free Art License
(FAL) Content Y Y
The Free Art License grants the right to freely copy, distribute, and transform creative works without infringing on the author's rights. Follows the principles of copyleft: freedom to use, copy, distribute, transform, and prohibition of exclusive appropriation.
UK Open Government Licence 2.0
(OGL-‐UK-‐2.0)
Content, Data
Y N
For use by UK government licensors this licence grants a worldwide, royalty free licence to re-‐use and redistribute a work on condition the source is appropriately credited.
Re-‐uses of OGL-‐UK-‐2.0 material may be released under CC-‐BY or ODC-‐BY. Version 1.0 is not conformant with the Open Definition.
Table 1: Open licences which conform to the Open Definition
ARIADNE D3.3 (Public)
13
The following licences conform to the Open Definition but are little used or deprecated [4]:
Licence Domain By SA Comments
GNU Free Documentation License (GNU FDL)
Content Y Y
A copyleft licence – derivative works must be made available under the same or a similar licence. It is principally intended “for works whose purpose is instruction or reference” and its most prominent user is Wikipedia. GNU FDL is only conformant subject to the Open Definition with certain provisos.
MirOS Licence
Code, Content
Y N Little used
Talis Community Licence
Data Y
This licence is only available in draft form and has been deprecated in favour of the Open Data Commons licences: PDDL, ODC-‐BY and ODC-‐ODbL
Against DRM Content Y Y Against DRM 2.0 is a free copyleft licence for artworks – but is little used.
Design Science License
Data Y Y Little used. Includes an interesting definition of source data3.
Table 2: Open licences that are less used
3 Design Science License definition: “Source Data” shall mean the origin of the Object Form, being the entire, machine-‐readable, preferred form of the Work for copying and for human modification (usually the language, encoding or format in which composed or recorded by the Author); plus any accompanying files, scripts or other data necessary for installation, configuration or compilation of the Work. (Examples of Source Data‚ include, but are not limited to, the following: if the Work is an image file composed and edited in PNG format, then the original PNG source file is the Source Data; if the Work is an MPEG 1.0 layer 3 digital audio recording made from a WAV format audio file recording of an analog source, then the original WAV file is the Source Data; if the Work was composed as an unformatted plaintext file, then that file is the Source Data; if the Work was composed in LaTeX, the LaTeX file(s) and any image files and/or custom macros necessary for compilation constitute the Source Data.) -‐ See more at: http://opendefinition.org/licenses/dsl/#sthash.QDQg7ZBo.dpuf
ARIADNE D3.3 (Public)
14
4.3 Linked Open Data
Tim Berners-‐Lee defines Linked Open Data as Linked Data that is released under an open licence [9]. Berners-‐Lee defined four expectations of Linked Data (to use URIs as names for things, to use HTTP URIs so that people can look up those names, provide useful information when someone looks up a URI, and include links to other URIs so that people can discover more things) and then proposed a star scheme to rate the openness of Linked Open Data. 5StarData.info provides examples for each step on the star scheme and discusses the costs and benefits [10].
Star Principle Comments
★ Make your stuff available on the Web (whatever format) under an open licence.
The content is accessible on the Web under an open licence published in a document such as a PDF. Other than by writing a custom scraper, it's hard to get the data out of the document.
★★ Make it available as structured data (e.g., Excel instead of image scan of a table)
The data is accessible on the Web in a structured way published in a document such as an Excel spreadsheet. To get the data out of the document you depend on proprietary software.
★★★ As above plus use non-‐proprietary formats (e.g., CSV instead of Excel)
The data is accessible on the Web in a structured way and is published in formats that mean everyone can use the data easily. On the other hand, it's still data on the Web and not data in the Web4.
★★★★ All the above, use open standards from W3C (RDF and SPARQL) and URIs to denote things, so that people can point at your stuff
Now data is in the Web. The data items have a URI that means they can be shared on the Web. A native way to represent the data is using RDF, however other formats such as Atom can be converted/mapped, if required.
★★★
★★
All the above plus link your data to other data to provide context
Now the data is published in the Web and is linked to other data, which means that both the consumer and the publisher can benefit from the network effect.
Table 3: 5 Star classification scheme for Linked Open Data
4 http://webofdata.wordpress.com/2010/03/01/data-‐and-‐the-‐web-‐choices/
ARIADNE D3.3 (Public)
15
At the International Linked Open Data in Libraries Archives and Museums summit, the various open licences were considered in the context of publishing content metadata as Linked Open Data. The summit came up with a four-‐star classification scheme that arranges the open licences in order of their openness and usefulness in this context [11].
Star Licences Comment
★ Attribution Share-‐Alike Licence (CC-‐BY-‐SA/ODC-‐ODbL)
The data is open but the Share-‐Alike licence limits the potential to combine datasets – as each must conform to the exactly the same Share-‐Alike licence. In Europe-‐wide research networks the Share-‐Alike licence reduces the re-‐use potential of a dataset (as there are several versions of the CC and ODC share-‐alike licences).
★★ Attribution Licence (CC-‐BY / ODC-‐BY) with a form of attribution not including linkbacks
The metadata is open and can be used provided the source is attributed. The data provider specifies the means of attribution, e.g. by specifying use of a ‘creator/source’ element in the metadata or a citation method (e.g. a scholarly citation). The disadvantage of this method for LOD is that users must discover the required mechanism for attribution and how to comply with it. Where different methods are applied for different datasets large-‐scale open data integration (e.g. mash-‐ups) become very difficult.
★★★ Attribution Licence (CC-‐BY / ODC-‐BY) when the licensor includes linkbacks to meet the attribution requirement.
The metadata is open and can be used provided the source is attributed. The user of the data fulfills the condition for attribution by including a web-‐link back to the source (see for example the method proposed for 5* Linked Open Data in table 3 above).
★★★★ Public Domain (CC0 / ODC PDDL / Public Domain Mark)
Metadata is fully open. It requires the least action by users to re-‐use the data, to link it or integrate the data with other data. It supports the creation of new services and encourages innovation. It maximizes public investment.
ARIADNE D3.3 (Public)
16
4.4 Attribution of research data
Satisfying the requirement of a CC-‐BY or ODC-‐BY licence for attribution of research datasets requires a system for data citation. This can help:
• the reuse and verification of data • the impact of data to be tracked • to recognise and reward data producers
The Archaeology Data Service (ADS), in line with recommendations from the Digital Curation Centre, has proposed that such a system must be able to uniquely identify the dataset, provide the reader with information needed to access the dataset, a means of access online, and be usable by both humans and software tools [12] [13]. The elements recommended by DCC and ADS for a data citation include:
• Author, Publication Year, Title, Edition, Version, Feature name and URI, Resource Type, Publisher, Unique numeric footprint (UNF), Identifier and location
DataCite is a not-‐for-‐profit organisation formed with the aim of promoting the citation of research data to increase its acceptance as a legitimate contribution to the scholarly record and supporting data archiving [14]. DataCite has proposed a similar (but simplified) set of elements for a data citation:
• Creator, Publication Year, Title, Version, Resource Type, Publisher, Identifier
There are various systems for establishing persistent identifiers (e.g Handles, Archival Resource Keys (ARKs) and Persistent URLs (PURLs)) that can all be resolved to an Internet location. The Digital Object Identifier (DOI) scheme is recommended by both ADS and DataCite for use with research datasets [15].
ARIADNE D3.3 (Public)
17
5 Situational analysis
The European Commission held a public consultation on open research data in July 2013. Five questions were posed to stakeholders to structure the debate, and the results of the consultation were subsequently published online [3]. Five similar questions were posed to partners in the ARIADNE consortium to gather their opinions and to understand the thinking of the archaeological research community on data sharing.
5.1 How do we define research data in archaeology?
Research data can be defined as any data captured by research activities or used for research. Data of interest for archaeological research5 includes data sets produced by archaeological researchers, research institutions, heritage agencies and as a result of contract archaeology. A survey of partners noted that archaeological researchers also use data captured for other purposes including airborne and satellite remote sensing data (captured for commercial mapmaking and other reasons), and digital 3D models produced for museum exhibitions or tourism.
There are various aspects to take into account in the definition of research data, including the conditions of data acquisition, how the data are used, and the questions posed the data has to answer. In the context of archaeology, data may relate to remains of human activity that have been destroyed since the data were captured. Research data must be identified and described to capture these aspects.
As a research infrastructure, ARIADNE’s focus is on the datasets deposited in repositories. This
5 “Research data in archaeology are the outcome of particular procedures of definition, data constitution, observation, capture and representation, as well as perceptual and cognitive processes of recognition, identification and categorization. They include all information objects that capture aspects of the domain of archaeology (the material traces of human activity) and that are, or may be, used to construct archaeological knowledge. They include analogue representations of archaeological sites, artefacts, ecofacts and traces of past human activity (such as photographs, drawings, descriptions and documentation) as well as data records.
There are ‘objective’ data such as measurements, geo-‐location and identification of material, and more ‘subjective’ data such as identifications of type, cultural provenance, dating and attribution. Archaeological data such as the above are produced as part of active research projects, i.e. there are active research groups that are working towards studying and publishing the results of their research, in which description of data is an important activity. However there are cases where data are the outcome of archaeological projects that happened many decades ago and still remain unpublished”.
ARIADNE D3.3 (Public)
18
may include:
• Data produced by research activity in interim and final formats, which are being made available for reuse; this includes:
o data produced as the result of particular procedures (e.g. measurement data, etc.)
o data produced as a result of perceptual and cognitive processes (e.g. identification of types or categorization, etc.)
o information objects that capture traces of human activity (e.g. photographs, drawings, etc.)
o raw and processed data • Metadata provided by researchers to describe their datasets. In practice, this metadata
tends to include “content” (information about the cultural object represented by a digital resource e.g. the date, the style/period, the historical geography, find spot) and “context” (information about the research questions and general conditions of data acquisition). Metadata may be incorporated into data files (e.g. a ground penetrating radar scan data includes data capture parameters). For these reasons most partners include metadata in their definition of research data;
• metadata provided to describe collections and their content; • preliminary datasets produced as a result of research activity (e.g. drafts) deposited for
archiving sometimes under restrictive conditions that prohibit re-‐use; • working archives of individual archaeologists (e.g. field diaries or personal notes)
deposited for archiving, sometimes under restrictive conditions that prohibit re-‐use. • project management data (e.g. email archives, management documents) deposited for
archiving, sometimes under restrictive conditions that prohibit re-‐use.
ARIADNE is a Europe-‐wide initiative, and it is important to bear in mind that the definition of archaeological data, and what constitutes research activity, differs between countries.
5.2 How and when does openness need to be limited?
5.2.1 Active research projects
During current research projects, whilst teams of researchers are actively engaged in collecting, recording and analysing information, openness needs to be limited to allow time for publication before the data is made available to everyone else.
ARIADNE D3.3 (Public)
19
In general, the right and obligation of publication of archaeological excavations lies with the excavator. Some countries (such as Greece) have acted to address delays in publication by setting a maximum number of years by which the work should be published by the excavator -‐ after this time the works can go to the public domain for study and publication.
In many countries across Europe, museums and custodial institutions have sui generis rights of reproduction and publication over cultural heritage objects in their collections. In some cases this right is limited to a period of years after the object comes to light for the first time.
Although full access to ‘active research’ data is generally restricted, a level of information access may be provided to allow other researchers to know who is working on a particular site, excavation, assemblage or archaeological research problem. Collection level descriptions may be available to provide information about the kind of content included in a data archive, while access to the full content is restricted.
When depositing data in a Data Archive (such as the Swedish National Data Service, KNAW-‐DANS in the Netherlands or the Archaeology Data Service in the UK) researchers decide on what access level the data shall have. This can include restrictions on access, whilst projects are still active. Researchers may limit access for a period of time and then make the data available for use by students and researchers from academic institutions, etc. In some cases, researchers may restrict access to certain data such as personal information, or request it be removed or merged, and in that way remove any restrictions over the rest of their dataset (see 4.2.7 below). A contract between the depositor (researcher/research team) and the archive/repository regulates the openness for data deposits. In most cases, data is deposited with its provenance (i.e. the field project, excavator or research team are identified) and licensed for use with the proviso that the researchers who produced the dataset are attributed.
5.2.2 Past research projects
In principle it should be possible to make data created by older archaeological research projects available for research, education and enjoyment.
In practice, access may be limited owing to the fact that in the past many creators reserved their rights by using “all rights reserved” as the default copyright statement. In more recent times, creators/providers of content have begun to take steps to express which uses of the content are permitted by using copyright licences, such as the ones developed by Creative Commons and the Open Knowledge Foundation [16] [17] (see also Appendix 1). However, the use of “all rights reserved” as the expression of copyright means the creators of many datasets from past research projects need to be contacted to obtain permission to use the data. This limits access.
ARIADNE D3.3 (Public)
20
5.2.3 Database rights
There is a specific European Union law on database rights (Directive No. 96/9/EC, 11 March 1996), which is implemented in the national law of Member States. This law was introduced to recognize the substantial investment made in compiling databases, and to prevent unauthorized copying or re-‐use of their content. Database rights are established automatically and cover both substantial extraction and copying of the database, and also piecemeal copying of data and subsequent reassembly. In principle, non-‐substantial or “fair use” is possible, but any substantial copying of relevant data requires obtaining permission and agreeing terms of use with the database owner. In addition to the database rights, the arrangement, selection and presentation of the data may also be protected by copyright [18] [19] [20].
5.2.4 Archaeological site location data
Certain types of archaeological sites (such as shipwrecks and places where there have been finds of gold, silver and other valuable objects) are vulnerable to treasure hunters. Cemeteries and sites that contain human remains are sensitive for various reasons, for example there may be living relatives of people buried in long dis-‐used churchyards that are the subject of a modern excavation. Archaeological sites and finds on military installations may also be sensitive.
Legislation varies between EU member states with some countries limiting access to information about the locations of such sites for protection reasons.
5.2.5 Commercial value
Some research institutions aim to exploit research results for commercial purposes. In such institutions, employees’ contracts may include clauses stating that research results (e.g. data) are the property of the institution.
Some publicly funded research institutions and individual projects may also operate on the basis that research results (e.g. data) are to be made available not only for further research but also for commercial exploitation. For example, the European Commission Communication: ‘Towards better access to scientific information: Boosting the benefits of public investments in
ARIADNE D3.3 (Public)
21
research’6 states the importance of making research results available rapidly to benefit European business and industry.
The Digital Michelangelo Project (1997-‐20047) was pioneering both in 3D digitization of masterpieces of the Italian Renaissance, and in the work done by Stanford University and the Italian Ministry of Cultural Heritage (MIBAC), to define the IPR over the data produced, and the rights for dissemination and commercial exploitation. The 3D models produced by the project are available for re-‐use under licence by researchers and scholars on application to Stanford University. Permission for commercial use of the models can be obtained by applying to the Italian government.
5.2.6 Privacy and data protection
Privacy and the protection of personal data is an important issue. There are cases where archaeological research datasets include information that directly or indirectly points towards a specific individual; access to which needs to be restricted under data protection legislation.
5.2.1 National legislation
EU member states have differing national legislation regarding cultural property. In some countries there is legislation that makes all material cultural heritage of a certain age the property of the state. For example, in Greece everything dating to before 1830 and listed monuments (or artefacts) of all dates are the property of the state.
Italian law (law n.42 of 22/01/2004) states: Art. 107 “The Ministry [of Culture], the regions and the other public bodies may allow the reproduction of cultural heritage they have in custody... [at a fee]” Art. 108 “The reproduction fee is fixed by the authority that is the custodian of the object [...]”. No fee is due for reproductions made for personal use, study reasons or valorization (by a public body) whether by private individuals or organizations (including commercial companies). Under this legislation, taking photos of cultural heritage objects (including museum collections) should be allowed on request for the specified uses. Any works that are produced should be licenced under a CC-‐BY-‐NC-‐SA framework, permitting future re-‐use under like conditions and limiting commercial re-‐use.
French legislation distinguishes the dissemination of public data, data produced through a
6http://ec.europa.eu/research/science-‐society/document_library/pdf_06/era-‐communication-‐towards-‐better-‐access-‐to-‐scientific-‐information_en.pdf 7 https://graphics.stanford.edu/projects/mich/
ARIADNE D3.3 (Public)
22
public service mission, from other data. Public data are freely re-‐usable in accordance with the intellectual property (law 17/07/1978). Under the policy of public data's openness on line, the state has created an open and free licence, the open licence Etalab8. It is compatible with any other open licence requiring the minimum to mention paternity.
Public officials assign their rights of reproduction and representation to their administration, cannot object to the disclosure and modification of their work under their public service missions, and maintain restricted moral rights. This is the case of Inrap archaeologists. Researchers and university’s teachers are an exception to this rule: although public officials they maintain all their rights.
It is worth noting there are circumstances under which legislation may require researchers to release data, for example requests under Freedom of Information legislation and Environmental Information Regulations.
The EU Directive on the re-‐use of public sector information (PSI directive 20039 10) has recently been amended to bring public sector libraries (including university libraries), museums and archives within its scope. The Directive looks at the re-‐use of material already public saying it should be available for both commercial and non-‐commercial uses. Charges may apply but the Directive states these should be limited to the “marginal costs of reproduction, provision and dissemination” with exceptions to this rule and on how the costs should be calculated. For archaeological documents held by libraries, museums and archives, these should first be available for re-‐use. The directive allows for exclusive agreements in the case of digitization projects by cultural institutions, which can limit re-‐use for a period of years after the project has been completed [21].
8 http://www.etalab.gouv.fr/ 9Directive 2013/37/EU of the European Parliament and of the Council of 26 June 2013 amending Directive 2003/98/EC on the re-‐use of public sector information. Official Journal of the European Union, L 175/1, 27.6.2013 http://eur-‐lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2013:175:0001:0008:EN:PDF.
10Directive 2003/98/EC of the European Parliament and of the Council of 17 November 2003 on the re-‐use of public sector information. Official Journal of the European Union, L 345/90, 31.12.2003, http://eur-‐lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2003:345:0090:0096:EN:PDF.
ARIADNE D3.3 (Public)
23
5.3 How should the issue of data re-‐use be addressed?
5.3.1 Licensing
While institutions and individuals are subject to the legislative and ethical reasons for limited access to data described above (section 4.2), there is a trend towards the planned release of research data under licence [22]. Some research funders and journals now require that data is deposited in repositories where it can be made available for other researchers to build on. Releasing data is beginning to be seen as being in researcher’s interests:
• Preparing data for release helps ensure that a clear record of how conclusions were reached is preserved
• A culture of openness enables interdisciplinary research and learning from mistakes as well as successes, and
• Has the potential to increase the impact of research academically, economically and socially.
Releasing data under licence protects copyright whilst clarifying the permitted uses. It is important to note that only the rights holder (or someone with permission to act on their behalf) can grant a licence; this means the intellectual property rights (IPR) need to be established before any licensing can take place.
Some data centres have prepared licences that depositors are asked to sign as a condition of deposit, for example both the ADS and KNAW-‐DANS deposit licences [23] [24]. Deposit licences set out the conditions under which the data centres provide access to the data for end-‐users.
Content licences, which may be either bespoke licences prepared for data centres or standard licences, are attached to content items to make the terms and conditions of access and use clear to end-‐users.
The Creative Commons (CC) licensing system is widely used because it offers a series of easy to use, standardised and automated licences that can be attached to content. There are four core stipulations (Attribution (By), Non-‐Commercial (NC), No-‐Derivatives (ND) and Share Alike (SA)) that can be included or excluded to produce seven basic licences:
• The three open licences described in section 3 above: CC0, CC-‐BY and CC-‐BY-‐SA.
• Four more restrictive licences:
o CC-‐BY-‐ND – Attribution No Derivatives -‐ allows for redistribution, commercial and non-‐commercial, as long as the content is not changed and the creator is credited.
ARIADNE D3.3 (Public)
24
o CC-‐BY-‐NC – Attribution Non Commercial -‐ allows others to remix, tweak, and build upon content, as long as the creator of the original content is credited and the new content is not commercial.
o CC-‐BY-‐NC-‐SA – Attribution Non Commercial Share Alike -‐ allows others to remix, tweak, and build upon content non-‐commercially, as long as the creator of the original content is credited and the new content is licenced under the identical terms.
o CC-‐BY-‐NC-‐ ND – Attribution Non Commercial No Derivatives -‐ This licence is the most restrictive of the CC licences. It allows others to download content and share it with others as long as the content is unchanged, the creator of the content is credited you and there is no commercial use.
Taking into account the various conditions of each licence, the licensor grants the user a worldwide, non-‐exclusive, perpetual (for the duration of the applicable right) licence to reproduce, display, perform, communicate and distribute copies of the work. The rights apply to all media and formats known now or subsequently developed (including any modifications technically necessary to exercise the rights in other media formats) [18]. In principle, all rights not expressly granted by the licensor are reserved.
The release of Version 4.0 of Creative Common’s core licence suite on 25 November 2013 provides both a more global licence framework (with official translations and licences that are ready to use without porting) and one that addresses applicable sui generis database rights explicitly. The new version also includes a slight change to reflect accepted practices permitting licensees to satisfy attribution requirements, where specified, with a link to a page for information [25].
5.3.2 Data citation
One of the problems with promoting access and re-‐use of data, is that until recently researchers have not been credited for publishing datasets in the same way as when they publish a research paper. There has been a move to data citation with mechanisms being put in place to allow authors to link journal publications to the underlying datasets [22]. Dataset citations should:
• uniquely identify the object cited, • be able to identify subsets of the data as well as the whole dataset, • provide the reader with enough information to access the dataset, • be readable by humans and also by software tools, so that services can be put in place to use the citations in metrics to support the academic reward system
ARIADNE D3.3 (Public)
25
The elements of a dataset citation are still under debate (see section 3.4 above) but should include details of the creator of the dataset (the author), the date of publication, title, resource type, publisher, identifier and its location.
5.3.3 Should ARIADNE adopt Creative Commons licences for resource discovery metadata?
We asked partners if ARIADNE decided to adopt CC licences for resource discovery metadata, would this would pose any issues for their organization?
INRAP – “We don’t use Creative commons License for the moment but it would not pose any issues for us, provided there is no commercial use (CC-‐NC). As a public institution of research, this is one of the most important criteria”.
SND – “has not adopted the use of CC, but we are looking into it and will give researchers the possibility to put a CC license on their material (data). There are no problems for SND if ARIADNE adopts a CC license on the metadata since most of the metadata at SND is created at/by SND. However some of the abstracts and similar “running text” are taken from (and referred to) reports and similar”.
AIAC – “We have already adopted a Creative Commons Sharealike ((CC BY-‐SA) license for the Fasti Online, and a Non-‐Commercial Sharealike licence (CC BY-‐NC-‐SA)) for the review FOLD&R.”
Other respondents gave their personal opinions, with one individual replying, “I think that all data produced with public money should be public. They were paid with my taxes and I want to own them. CC licensing is a good way to protect them for the community…That said, there is still some way to go before laws, regulations and habits are changed”…“I would also expect that data opening becomes a condition for any public research grant, for the same principle stated above, and since archaeological research requires an excavation permit, this also could be a way to enforce an open, although IPR respectful, licensing scheme, for example CC-‐BY-‐NC-‐SA ...”
Another respondent said “For research data for which rights belong to the archaeological archives/organizations in ARIADNE, it would be great if resource discovery metadata became available according to a Creative Commons License, under the constraints/qualifications noted above. An attribution-‐derivatives-‐non-‐commercial license would sit well with me, and I imagine would resonate well with primary creators/custodians of such data.”
ARIADNE D3.3 (Public)
26
A third respondent commented, “I think that resource discovery metadata does not pose much of a problem however open the license is. Metadata however might include rich descriptive information created with much effort by researchers, archivists, curators or librarians. In such cases Public Domain (CC0 / ODC PDDL / Public Domain Mark) might not seem appropriate for some providers (CC-‐BY / ODC-‐BY probably)”.
This comment prompted another to remark “are we sure that we want CC-‐BY instead of CC0 for metadata? I mean, metadata are used for 'processing', making queries etc. That means that when you produce any kind of result based on such metadata you should be legally obliged to cite all the authors of the metadata involved in the queries...Personally I would prefer the CC0 approach of Europeana” (See [26] for more information on the Europeana approach).
5.4 How should we enhance data awareness and the culture of sharing?
“Data awareness” can be taken to mean awareness amongst researchers that it is important to share data in an open and trustworthy manner.
The “culture of sharing” has varied according to the type of research and the data produced. There are quite well established practices for sharing excavation results (and data sets), but sharing of other types of data is less well established.
The Swedish National Data Service reports that it has noticed a change in data awareness with an increase in numbers of researchers requesting access data. It suggests there are several reasons for this: research funders like the Swedish Research Council are recommending the deposition and sharing of data financed by them; impact from other countries and from other researchers; and also the increased awareness of organizations like SND.
INRAP suggested promoting data sharing on a large scale, crossing national boundaries, in a way that encourages synthesis work would help to raise awareness.
Outreach by data centres such as ADS, SND and KNAW-‐DANS delivering seminars, workshops, training and road-‐shows where they talk about the benefits of sharing data raises awareness and encourages researchers to get in contact about deposition of data.
Incentives to encourage researchers to share their datasets include:
• Establishing and promoting the practice of data citation (as a means of giving academic credit to the data creator).
• The use of persistent identifiers (PIDs) as a means of linking datasets from different sources and making new inquiries into them – and enabling new research.
ARIADNE D3.3 (Public)
27
• Offering tools to make it easier to share, such as simple metadata applications or to enable institutionally held data to be uploaded easily to a central website for archiving.
Providing open access to resource description/discovery metadata as a means for researchers to discover the existence of datasets in repositories and portals is technical mechanism for raising awareness of the data itself.
ARIADNE D3.3 (Public)
28
6 Survey of ARIADNE datasets
A survey was carried out of the datasets, which ARIADNE partners plan to provide for ingestion to the ARIADNE infrastructure. The content and metadata being made available were analysed and the results reported in the initial report on the project registry [27]. The survey also included questions about the rights and access policies in place for the 28 collections being offered for ingestion, the findings of which are analysed in this section. Detailed responses are presented in Appendix 1 below.
6.1 Rights holders
The datasets survey revealed that of the 28 collections proposed for ingestion to the ARIADNE infrastructure, 61% include content from many different research teams and have multiple rights owners. The collections with multiple rights include the holdings of data archives such as the ADS, DAI, DANS, Discovery Programme, MiBAC, MNM-‐NOK etc. 39% of the collections had single rights holders including AIAC’s FOLD&R Journal, INRAP’s collections and the collections of the Austrian Academy of Sciences.
Figure 1: Rights holders in ARIADNE datasets
ARIADNE D3.3 (Public)
29
6.2 Content copyright
It is perhaps not surprising that the survey of ARIADNE partners’ datasets revealed there is copyright in 83% of the collections identified for ingestion. Only 8% of collections were described as “open” with one further collection (3% of the total) being made available under copyleft principles. One partner, KNAW-‐DANS reported that in principle it is possible for researchers to deposit collections with additional restrictions on access or temporary embargos (while research is completed) and these conditions show as affecting 6% of collections in figure 2 below.
Figure 2: Rights in ARIADNE datasets
Several partners whose collections include data deposited by many different researchers reported that copyright, licensing and conditions for use are agreed with individual content owners at the time of deposit.
6.3 Content Access
The access that is currently available to the collections identified in the survey varies. A majority of the collections are available online, with only 3% being offline, and only 5% currently reported as being closed to users (see Figure 3 below). 50% of all the collections are freely available online with a further 39% available online to registered users. One collection (3% of the total) is available online to users after they click-‐through to accept the licence conditions.
ARIADNE D3.3 (Public)
30
Figure 3: Access to datasets
The survey revealed that 42% of the collections identified by partners are made available using standard Creative Commons licences (Figure 4 below). At 22% the most widely used is CC BY NC SA (By Attribution, Non-‐Commercial, Share-‐Alike) with CC BY NC ND (By Attribution, Non Commercial, No-‐Derivatives) being the next most used licence at 14% of collections. 17% of collections are covered by ‘open’ licences with 3% being placed in the Public Domain (CC 0), 3% under CC BY SA, 3% under the French Open licence and 8% reported as being openly licensed.
Figure 4: Content licences in use
ARIADNE D3.3 (Public)
31
47% of the collections identified in the survey are made available to users under terms and conditions that are not defined by standard licences. In general this means that users need to apply to the content holder for permission to use the content for publication or other purposes. Two partners (ADS and KNAW-‐DANS) have developed their own licences. In the case of KNAW-‐DANS, data depositors can choose between the equivalent of a CC BY (By Attribution) licence for open access content, or to restrict access to a certain group and/or certain time (with the possibility of a temporarily embargo for up to two years) (see Appendix 2 below). In the case of ADS, the licence permits the use of data with attribution for research, learning, and teaching, and also for commercial archaeological projects with the provision that the outputs end up in the public domain. Thus the ADS licence is the equivalent of the CC BY NC SA (By Attribution Non-‐Commercial) licence with some specified commercial uses being permitted – see Appendix 3 below. SND enables depositors to specify differing levels of access for their datasets with some sub-‐sets of the collection being available on open access and other sub-‐sets accessible under more restrictive conditions (see Appendix 4 below).
It is also worth noting that different versions of CC licences are in use by partners. Version 3.0 is the most commonly used, however version 2.5 is also used.
6.4 Metadata rights
As part of the survey ARIADNE partners were asked whether metadata was separately available for their content and if so under what licence conditions.
76% of the collections that were identified have metadata available for the content items. The 24% of the collections that lack separate metadata are mostly databases where the records could be considered as metadata, or used to export metadata records if required.
All of the partners were asked if they were able to make the metadata for their content available under a CC0 (Public Domain) licence (see figure 5). Twelve partners replied they were able to make their content metadata available under the CC0 licence representing 60% of the identified collections. Two partners reported they were currently thinking of the CC BY NC SA licence; the Discovery Programme replying that as no separate metadata was available for its databases the content licence was applicable; ZRC SAZU replying it had CC BY NC SA in mind, but was willing to consider CC0 for its metadata if this is important for ARIADNE’s success. Of the three partners who replied they were not
ARIADNE D3.3 (Public)
32
Figure 5: Metadata licences able to make their metadata available under the CC0 licence, INRAP is making a subset of its metadata available under the French Open Data platform and suggested this could be linked to. Athena RC reported a complex rights situation in relation to its clay database and thus said that its metadata could not currently be made available under the CC0 licence. NIAM-‐BAS reported that it needs to determine its strategy and suggested that some elements of its metadata might be made available under a CC0 licence.
6.5 Specific conditions affecting Access
Some partners reported specific factors affecting access to their collections in the survey. For example, Athena-‐RC reported that its clay database contains the results of its analysis of sherds from various excavations. It explained that Athena-‐RC owns the moral rights to the results of its analysis, but access to information about the sherds themselves requires permission from the archaeologists responsible for excavating them. Clearing the rights to accessing this information will involve contacting all the archaeologists involved to obtain their permission.
MNM-‐NOK reported it holds some sensitive datasets, which include information about the locations of sites vulnerable to looting, and that it restricts access to registered users only for this reason.
Some partners are planning to provide databases for ingestion to the ARIADNE infrastructure, which don’t currently have separate metadata. For example, Athena-‐RC said of its clay database that they have difficulty in distinguishing “between metadata and content”. The Discovery Programme similarly reported that the content three of its databases (WODAN, Mapping Death and the Irish Stone Axe project) could be considered as metadata.
ARIADNE D3.3 (Public)
33
7 Discussion
The principle purpose of developing data sharing policies is to help establish best practices in the management of rights and data access by partners in the ARIADNE infrastructure, and the wider archaeological research community. As content partners are responsible for receiving datasets deposited by archaeological researchers, for managing access to those datasets, and will be providing datasets for ingestion to the ARIADNE infrastructure, it is important to consider the whole supply chain (see Figure 6: Data sharing activity chain).
After consulting with partners it is clear that access and sharing policies are evolving. Management of IPR and licensing of content is well established and understood by some partners; others are still working through the process. There are national and institutional variations, and legacy datasets deposited under past frameworks to be taken into consideration. However it is clear there is a common move towards the explicit licensing of content and metadata so that datasets can be made available for research, education and public use.
The activity chain involves the management of rights and data sharing policies at different stages. Some of the key points in the chain are:
7.1 Deposit agreements with content providers
This represents the point when partners in the ARIADNE infrastructure receive deposits of data from archaeological researchers, whether from within their own organization or from external organizations. This is the moment in time when information about the provenance (research team, project) of the dataset and any underlying rights (objects, sites, data re-‐use) is collected and agreements reached for access permissions etc. There is no standard framework, although recommendations can be made on best practices and the adoption of standard licences (the CC licence suite).
7.2 Agreements with ARIADNE
The point when organizations reach agreements with ARIADNE to share their datasets with the research infrastructure is the moment in time when agreements need to be reached about the licensing of resource description metadata and content (for research, education, public and/or commercial use), permissions for data re-‐use (making derivatives), and data citation (accreditation) etc.
ARIADNE D3.3 (Public)
34
7.3 Data sharing and access
This is the framework under which users access datasets via the ARIADNE infrastructure, and covers policies for data citation, provision of unique persistent identifiers for datasets (and subsets), and licences for resource description metadata and content.
Figure 6: Data sharing activity chain
Collection description, DOI and license framework
Archaeological researcher
Access agreement
Content partner
Objects and sites provenance, accreditation, assets, IPR
Deposit agreement
ARIADNE infrastructure
Metadata, content, public use, derivatives,
commercial use, citation
Portal and search engine
ARIADNE D3.3 (Public)
35
7.4 Licence framework
The most widely adopted legal framework being used by partners to manage access and sharing of data is the Creative Commons suite of licences, and therefore this is likely to provide the most suitable framework for ARIADNE. The main issues are discussed below.
7.4.1 Resource description/Collection description metadata
Such metadata is used to provide for the identification (discovery) of collections, sub-‐collections and/or individual content items within data repositories. Although the partner survey showed this type of metadata is not currently available for all ARIADNE datasets, it could be generated (at least at collection level). The survey revealed there is general consensus amongst partners that open access should be provided to this type of metadata. The main area of difference was whether to follow the Europeana model and adopt the CC0 (public domain) licence or the CC BY licence (to ensure attribution of the content provider).
7.4.2 Content licensing
The responsibility for negotiating and agreeing which permissions are to be licensed by archaeological researchers for their content lies with content partners. ARIADNE is able to suggest best practices, such as the use of the Creative Commons licence suite. The main issues to be considered are:
• The Attribution condition could be problematic if data are to be combined with data from a large number of other sets due to the administrative burden of crediting each individual contributor in the manner of their choosing [22].
• The Share Alike condition can cause problems, as it requires the licensee to release any derived dataset under the same licence and thus prevents it from being combined with data released under a different licence. This is true even within Creative Commons: a derived dataset cannot contain both CC BY-‐SA-‐licensed data and CC BY-‐NC-‐SA-‐licensed data.
• The No Derivatives condition may restrict data reuse – it requires that data is used ‘as-‐is’, although precisely what this means in practice is a matter of debate [22].
• The Non-‐Commercial condition would not cause any problems for ARIADNE with regard to combining data, but it may have wider implications, as what constitutes commercial use is ambiguous. Depending on interpretation, the NC condition may or
ARIADNE D3.3 (Public)
36
may not prevent data from being used in works for which an author is paid (such as textbooks), in works that are sold (such as journal articles).
Several partners confirmed their support for the principle of open access to research data in the partner survey. However, the use of the NC condition (or limitation of access to education, research or public uses only) means that many of the licence agreements in place do not fully conform to the definition of open data given by OpenAIRE or the Open Definition [6] [7]: “a piece of data or content is open if anyone is free to use, reuse, and redistribute it — subject only, at most, to the requirement to attribute and/or share-‐alike”.
The reasons for using the non-‐commercial licence clause are not entirely clear. It may be that the content has potential commercial value and is being offered under a multiple licencing regime (this strategy was adopted for the Digital Michelangelo project). Such a strategy allows data to be distributed under both a copyleft licence and under an alternate licence on payment of a fee for commercial uses [22].
Finally, the partner datasets survey showed that various different versions of CC licences are currently being used. It would be useful to find out whether it is possible (and useful) to port existing licences to the newly released version 4.0, which is said to be more user-‐friendly. The CC version 4.0 suite should be used for licensing of databases, as this is the only version which explicitly covers sui generis database rights [25]. Research by the OpenAIRE project also endorses the use of CC 4.0 licences for scientific datasets for the same reason [28].
ARIADNE D3.3 (Public)
37
8 Recommendations
The surveys and desk-‐top research carried out have identified various elements, which ARIADNE is recommended to include in its data sharing policy framework:
1. A common method of data citation should be established for adoption by partners and promotion by ARIADNE to the archaeological research community. Academic recognition is an important motivation for encouraging researchers to share access to their datasets.
2. Allocation of DOIs or the equivalent to datasets ingested to the ARIADNE infrastructure should be investigated. The system used should be capable of identifying sub-‐sets within collections. Persistent identification of datasets is important in underpinning data sharing and data citation.
3. The Creative Commons licence suite (version 4.0 is preferred) should be used for content (databases, document archives, images, 3D models, etc.) provided to ARIADNE by content partners under licence permissions agreed with the content owner. CC BY is recommended for open access. CC BY SA or CC BY SA NC licences may also be applicable.
4. It is recommended that together with the content itself, partners be requested to provide:
• A collection description (of the whole collection and sub-‐sets within the collection) published under a CC BY licence for each dataset ingested into the ARIADNE infrastructure. Collection description is a useful way of capturing the provenance and contextual information about data collections, and can be used to underpin data citation.
• Item level metadata records should be published under a CC0 licence – to enable integration of multiple datasets within the metadata repository, support resource discovery and enable Linked Open Data. As ARIADNE will be ingesting multiple datasets from different content providers under differing existing licence conditions, it is recommended that ARIADNE follows the example of Europeana, and defines a metadata element set that can be published under an open licence (CC0 is the most open, CC BY if public domain licensing cannot be agreed upon).
ARIADNE D3.3 (Public)
38
9 References
[1] Austin, T. & Mitcham, J.: Preservation and Management Strategies for Exceptionally Large Data Formats: ‘Big Data’. ADS & English Heritage, 2007. http://archaeologydataservice.ac.uk/attach/bigData/bigdata_final_report_1.3.pdf
[2] Swan, A.: Sharing Knowledge: Open Access and Preservation in Europe, Conclusions of a strategic workshop, European Commission, Brussels, 2010. http://ec.europa.eu/research/science-‐society/document_library/pdf_06/oa-‐preservation-‐2011_en.pdf
[3] European Commission: Report of the European Commission Public Consultation on Open Research Data, 2013, http://ec.europa.eu/research/science-‐society/document_library/pdf_06/report_2013-‐07-‐open_research_data-‐consultation.pdf
[4] ARIADNE, 2013, D3.2 Report on project standards.
[5] Open Access Max Planck Gesellschaft, 2003, Berlin Declaration on Open Access to Knowledge in the Sciences and Humanities, http://openaccess.mpg.de/286432/Berlin-‐Declaration
[6] OpenAIRE: the Open Access Infrastructure for Research in Europe, 2013, Open Access Overview, website: http://www.openaire.eu/en/open-‐access/open-‐access-‐overview (accessed 1/1/2014)
[7] Open Definition, 2013, website: http://opendefinition.org/ (accessed 20/12/2013)
[8] Open Access Spectrum, 2013, How Open Is It?, online: http://www.plos.org/about/open-‐access/howopenisit/ (accessed 20/12/2013)
[9] Berners-‐Lee, Tim, 2006 with additions in 2010, Design Issues: Linked Data, discussion document online: http://www.w3.org/DesignIssues/LinkedData.html (accessed 20/12/2013)
[10] 5 * Open Data, 2012, website: http://5stardata.info/ (accessed 20/12/2013)
[11] LODLAM: International Linked Open Data in Libraries Archives and Museums summit, 2012, website online: http://lod-‐lam.net/summit/2011/06/06/proposed-‐a-‐4-‐star-‐classification-‐scheme-‐for-‐linked-‐open-‐cultural-‐metadata/ (Accessed 21/12/2013)
[12] Hardman, C. 2013, The Archaeology Data Service: Data Preservation and persistent identifiers in UK archaeology’, ODIN codesprint and first year conference, October 2013,
ARIADNE D3.3 (Public)
39
http://indico.cern.ch/getFile.py/access?contribId=19&sessionId=19&resId=1&materialId=slides&confId=238868 (accessed 21/12/2013)
[13] Ball, A. and Duke, M., 2012, How to Cite Datasets and Link to Publications, Digital Curation Centre, http://www.dcc.ac.uk/resources/how-‐guides/cite-‐datasets (accessed 30/12/2013)
[14] DataCite, 2013, website: http://www.datacite.org/ (Accessed 21/12/2013)
[15] International DOI foundation, 2013, website: http://www.doi.org/ (accessed 21/12/2013)
[16] Creative Commons, 2013, website: http://creativecommons.org/ (accessed 21/12/2013)
[17] Open Knowledge Foundation, 2013, website: http://okfn.org/ (accessed 28/12/2013)
[18] Guibault, Lucie (2013) Licensing Research Data under Open Access Conditions. Chapter to be published in: D. Beldiman (ed.), Information and Knowledge: 21st Century Challenges in Intellectual Property and Knowledge Governance, Cheltenham, Edward Elgar, upcoming 2013, http://www.ivir.nl/publications/guibault/Open_Research_Data.pdf (accessed 27/12/2013)
[19] Christian G.E (2009) Building a sustainable framework for open access to research data through information and communications technologies. International Development Research Centre, Canada, December 2009, http://idl-‐bnc.idrc.ca/dspace/bitstream/10625/41336/1/129183.pdf (accessed 27/12/2013)
[20] Hugenholtz, Bernt (2005) Abuse of Database Right. Sole-‐source information banks under the EU Database Directive. In: F. Lévêque & H. Shelanski (eds.) Antitrust, patents and copyright: EU and US perspectives, Cheltenham: Elgar 2005, pp. 203-‐219, http://www.ivir.nl/publications/hugenholtz/abuseofdatabaseright.pdf (accessed 27/12/2013)
[21] Zijlstra, T. and Janssen, K., 2013, The new PSI directive – as good as it seems? Open Knowledge Foundation blog post, April 19, 2013: http://blog.okfn.org/2013/04/19/the-‐new-‐psi-‐directive-‐as-‐good-‐as-‐it-‐seems/ (accessed 30/12/2013)
[22] Ball, A, 2012, How to License Research Data, Digital Curation Centre, http://www.dcc.ac.uk/resources/how-‐guides/license-‐research-‐data (accessed 30/12/2013)
[23] ADS, 2013, ADS deposit license, URL: www.ahds.ac.uk/documents/ahds-‐archaeology-‐licence-‐form.doc (accessed 30/12/2013)
ARIADNE D3.3 (Public)
40
[24] DANS, 2013, DANS license on deposited data http://www.dans.knaw.nl/en/content/dans-‐licence-‐agreement-‐deposited-‐data (accessed 30/12/2013)
[25] Creative Commons, 2013, What’s new in 4.0, https://creativecommons.org/version4 (accessed 2/1/2014)
[26] Europeana, 2012, Data Exchange Agreement, online explanation: http://pro.europeana.eu/web/guest/data-‐exchange-‐agreement (accessed 1/1/2014)
[27] ARIADNE, 2013, D3.1 Initial report on project registry
[28] Guibault, Lucie & Wiebe, Andreas (eds., 2013): Safe to be Open: Study on the protection of research data and recommendation for access and usage. University of Göttingen Press, http://webdoc.sub.gwdg.de/univerlag/2013/legalstudy.pdf (accessed 27/1/2014)
ARIADNE D3.3 (Public)
41
Glossary
ARK Archival Resource Key is a Uniform Resource Locator (URL) that is a multi-‐purpose identifier for information objects of any type.
BY By attribution
CC Creative Commons
DOI Digital Object Identifier is a character string (a “digital identifier”) used to uniquely identify an object such as an electronic document. Metadata about the object is stored with the DOI name and a URI or URL, where the object can be found. The DOI for a document is permanent, whereas its location and other metadata may change. The DOI system is implemented through a federation of registration agencies coordinated by the International DOI Foundation, which developed and controls the system. Organisations, such as ADS and KNAW-‐DANS, who meet the requirements of the Foundation can pay to join the system and allocate DOIs.
Handle In computer programming, a handle is an abstract reference to a resource.
HTTP HyperText Transfer Protocol (HTTP) is an application protocol for distributed, collaborative, hypermedia information systems.
IT Information Technology
ODC Open Data Commons
PURL Persistent Uniform Resource Locator is a uniform resource locator (URL) (i.e. location-‐based uniform resource identifier or URI) that is used to redirect to the location of the requested web resource.
SA Share Alike
UNF Unique Numeric Fingerprint is a cryptographic hash of the data, which is used in citations to ensure that no change has occurred to the data since it was cited.
URI Uniform Resource Identifier is a string of characters used to identify a name of a web resource.
ARIADNE D3.3 (Public)
42
Appendix 1: Ariadne questionnaire on datasets, metadata and data sharing policies
RIGHTS AND ACCESS
Rights holder(s) -‐ The owner(s) of the rights of the content being provided
ADS datasets (Archsearch, Grey literature reports, individual archives, Linked Data endpoint) -‐ Archaeological data are the product of many years of scholarship by numerous archaeologists, collectors, analysts, antiquarians, field workers and laboratory scientists. It is not possible to credit all these scholars individually, or sometimes even identify them, but in making their data available for re-‐use, the ADS acknowledges their contribution. At all times, the ADS seeks to protect the intellectual property rights and copyright of the originators of data where that can reasonably be achieved. The catalogue also includes links to other service providers. It is the responsibility of users to acknowledge and comply with the copyright conditions that may be imposed by other service providers.
AIAC – FASTI Online – AIAC is the copyright holder of data contained on FASTI with specific attribution and permissions for use of multimedia granted by project summary authors.
AIAC – FASTI Online FOLD&R Journal – AIAC is the copyright holder for journal publications.
ARHEO – Survey data of archaeological sites, Geophysical data, Analysis of ceramics from excavations -‐ The content in ARHEO is accessible online (http://arheovest.com/fildsurvey.html). The data itself has conditions and license agreements.
ARUP-‐CAS – Archaeological map of the Czech Republic -‐ Institute of Archaeology ASCR, Prague, v.v.i.; Institute of Archaeological Heritage Care of Central Bohemia; Museum of The Bohemian Paradise, Turnov.
Athena-‐RC – CETI – Clay database -‐ The owner(s) of the rights of the content being provided + Athena R.C. “the sherds we have analyzed are from various excavations. “Athena” RC does not own the sherds. However, since we are making the measurements the results are our “property”, so we own the rights for them.”
Cyi_STARC – Archaeological collections -‐ The content in STARC-‐Repo is free for use and open access. The data itself has conditions and license agreements. Depositors own the data even if the dataset is open access. The Depositors are in this case the following Institutions: Department of Antiquities Cyprus, University of Sydney, Byzantine Museum and Art Gallery of the Archbishop Makarios III Foundation, Mediterranean Archaeological Research Institute-‐Vrije Universiteit Brussel, The Cyprus Folk Art Museum and The A.G. Leventis Foundation.
DAI – iDAI.images, Arachne – Several rights holders.
ARIADNE D3.3 (Public)
43
DANS eDepot for Dutch Archaeology (EDNA) -‐ Metadata (the content of all fields under the “Description” tab in every dataset in EASY) is free for use and open access. However the data itself has conditions for use and license agreements. Depositors own the data, even if a dataset is open access, the data can only be used for personal use.
DANS Digital Collaboratory for Cultural Dendrochronology (DCCD) -‐ Individual copyrights of data (some top level metadata free open access).
Discovery Programme – WODAN -‐ The content of the database is are provided by all the palaeo-‐environemntal specialists in Ireland, therefore each record-‐set is their copyright, however the principles of the resource is that to be able to store your content using WODAN you must enable your data to used under CC.
Discovery Programme – Mapping Death -‐ The content of the database is provided by all the Discovery Programme. Some associated media which may have been provided by commercial consultants and phD candidates e.g. lab reports and excavation documents will be there copyright.
Discovery Programme – Irish Stone Axe Project (ISAP) Database -‐ The content of the database is are provided by all the University College Dublin (UCD).
Discovery Programme – SHARE-‐IT (Spatial Heritage Archaeological Research Environment) -‐ Content providers: Discovery Programme, UCD (selected), NUI Galway (selected) and some commercial companies.
INRAP -‐ Archeozoom database -‐ Open data: the owner of the rights for the geolocation database is Inrap; the owner of the rights for the content database is Inrap; the owners of the rights for the editorial content are Inrap and the authors. All editorial contents are subject to copyright from the authors.
INRAP – Dolia -‐ The owner of the rights for the database is Inrap. Each document inside the database is subject to copyright from to the authors, including the PDF documents but also some parts of the bibliographical records and esp. the abstract of the document.
INRAP – Iconothèque – Images d’Archéologie (IDA) -‐ The owner of the rights for the database is Inrap. Each photo or video document is subject to copyright by the authors. All editorial contents are subject to copyright from the authors. Inrap is the owner of the exploitation rights for the photographs and granted the Réunion des Musées Nationaux Photo Agency (public institution) the commercial exploitation of hi-‐def photographs.
MIBAC-‐ICCU: SITAR -‐ Ministero per i Beni e le Attività Culturali e il Turismo -‐ Soprintendenza Speciale per i Beni Archeologici di Roma for all SITAR GeoDB dataset/records and archive documents directly owned by SSBAR. For any external Archives documents possibly stored in SITAR web file system (e.g. public cartographic bases, historical document, etc.) the owner institution / natural person copyright specifications apply.
MIBAC-‐ICCU: CulturaItalia -‐ the data’s right belong to the CulturaItalia content provider.
ARIADNE D3.3 (Public)
44
MNM-‐NOK – Archaeological excavation datasets -‐ In Hungary archaeologists are given five years to assess their excavated materials. Therefore, the archaeological dataset could be made public only when the five years are over for a given excavation. After that the data’s right belong to the project partner. Datasets of sites and scientific analyses belong to the project partner without time restriction.
NIAM-‐BAS: Archaeological Map of Bulgaria -‐ The respective organizations and people responsible for the particular entry. (over 50 organizations).
OEAW -‐ Franzhausen_Kokoron -‐ Austrian Academy of Sciences (M. Lochner).
OEAW -‐ UK_Material-‐POOL -‐ Austrian Academy of Sciences (M. Lochner).
OEAW -‐ Fundprotokoll_Thunau_Scan -‐ Austrian Academy of Sciences.
SND -‐ GIS data -‐ The metadata (the content of all fields in the SND catalogue) is free for use and open access. The data itself has conditions and license agreements. Depositors (in this case University of Uppsala) own the data even if the dataset is open access.
ZRC SAZU – ARKAS and ZBIVA -‐ Inštitut za arheologijo ZRC SAZU.
Content copyright
Describe the copyright of the content in the dataset being provided – is there a standard rights framework (e.g. Creative Commons licences), or many individual copyrights?
ADS datasets (Archsearch, Grey literature reports, individual archives, Linked Data endpoint) -‐ All material on this web server is protected by copyright. Specific copyright holders are identified in the appropriate pages and sections. In all other instances, copyright is retained by The University of York on behalf of the Archaeology Data Service (ADS). Links to materials on other servers should not be construed as a claim over them. All rights reserved. A non-‐exclusive, non-‐transferable licence is hereby granted to those using or reproducing, in whole or in part, the material for valid not-‐for-‐profit teaching and research purposes, providing the copyright owners are acknowledged. The Archaeology Data Service should be cited as the source of the material from this server. Where specific permission to use material is required, this is identified and such permission must be sought from the copyright holder or agency cited. Anyone wishing to use the catalogue for any other purpose must contact the ADS to seek permission.
AIAC – FASTI Online – all data on FASTI is licenced as Creative Commons Attribute-‐ShareAlike 3.0 Unported (CC BY-‐SA 3.0).
AIAC – FASTI Online FOLD&R Journal – all FOLD&R publications are licenced as Creative Commons Attribute Non-‐Commercial ShareAlike 3.0 Unported (CC BY-‐NC-‐SA 3.0)
ARHEO – Survey data of archaeological sites, Geophysical data, Analysis of ceramics from excavations – Open.
ARIADNE D3.3 (Public)
45
ARUP-‐CAS – Archaeological map of the Czech Republic -‐ Three organizations (Institute of Archaeology ASCR, Prague, v.v.i.; Institute of Archaeological Heritage Care of Central Bohemia; Museum of The Bohemian Paradise, Turnov) are the owners of the photographs but their contents is covered by the individual author´s property rights. In practice, the organization may display the photographs but their use (in publications, etc.) should be allowed by the author(s).
Athena-‐RC – CETI – Clay database -‐ No standard rights framework.
Cyi_STARC – Archaeological collections -‐ For each dataset the content copyrights depends on the copyright agreement of CyI-‐STARC with the Depositors.
DAI – iDAI.images, Arachne – The whole work is licenced by a Creative Commons licence (CC BY NC ND) but this implies only for the free accessible content of the Arachne Web Site. However there are different appointments for other content, which is related to the Arachne Datasets, but not freely accessible (e.g. high resolution images).
For more information: http://arachne.uni-‐koeln.de/drupal/?q=node/11
DANS eDepot for Dutch Archaeology (EDNA) -‐ Individual copyrights of data (metadata free open access).
DANS Digital Collaboratory for Cultural Dendrochronology (DCCD) -‐ Individual copyrights of data (some top level metadata free open access).
Discovery – WODAN -‐ As the database stores content form many experts the license ultimately remains a choice of theirs. However, all experts have been advised to be open with their data and adopt CC Attribution-‐NonCommercial-‐ShareAlike 3.0 Unported.
Discovery – Mapping Death -‐ Data within Mapping Death (excluding associated media files) is licensed as CC Attribution-‐NonCommercial-‐ShareAlike 3.0 Unported.
Discovery – Irish Stone Axe Project (ISAP) Database – Data within ISAP Database (excluding associated image files) is licensed as CC Attribution-‐NonCommercial-‐ShareAlike 3.0.
Discovery – SHARE-‐IT (Spatial Heritage Archaeological Research Environment) -‐ Data within SHARE-‐IT is available as CC Attribution-‐NonCommercial-‐ShareAlike 3.0 Unported.
INRAP -‐ Archeozoom database -‐ The applicable rights framework is the French Code de la Propriété Intellectuelle (Intellectual Property Law), for each individual copyright owner. The geolocation and content databases are licensed under “Open Licence” (http://www.data.gouv.fr/Licence-‐Ouverte-‐Open-‐Licence). All editorial contents provided by Inrap on their online platforms are licensed under a “Creative Commons” Attribution-‐NonCommercial-‐NoDerivs 3.0 Unported License.
INRAP – Dolia -‐ There is no standard rights framework. The applicable rights framework is the French Code de la Propriété Intellectuelle (Intellectual Property Law), for each individual copyright owner.
ARIADNE D3.3 (Public)
46
INRAP – Iconothèque – Images d’Archéologie (IDA) -‐ The applicable rights framework is the French Code de la Propriété Intellectuelle (Intellectual Property Law), for each individual copyright owner (high-‐resolution images). All editorial contents provided by Inrap are licensed under a “Creative Commons” Attribution-‐NonCommercial-‐NoDerivs 3.0 Unported License.
MIBAC-‐ICCU: SITAR GeoDB specific datasets (OI, PA, UA, DT, digital objects, etc. owned by SSBAR in its archives) the following copyright/copyleft approaches are expected:
• CC-‐0 license for SITAR primary source open data structured such as in e.g. Wolfram | Alpha (http://www.wolframalpha.com/about.html) with concept of primary source (public data consolidated in its own databases) vs. external sources (on line and other public / not public datasets and scientific/statistical data); could be available also for open data and linked Open Data applications; such as in Europeana;
• CC-‐BY-‐SA-‐NC for SITAR GeoDB dataset, single record and/or digital object available on line/off line in for scientific and territorial studies;
• digital copies of external archives documents that may be stored in the SITAR web file system (e.g. public cartographic bases, historical document, etc.) in compliance with owner institution / natural person copyright specifications.
MIBAC-‐ICCU: CulturaItalia -‐ There are many individual copyrights that depend on the content provider’s policy.
MNM-‐NOK – Archaeological excavation datasets -‐ There are many individual copyrights, the leaders of individual excavations have limited copyrights on the documentation (see above). It is best to contact them for further use of the material.
NIAM-‐BAS: Archaeological Map of Bulgaria -‐ Varied or undetermined. An effort should be made to bring all the data under a common copyright status.
OEAW -‐ Franzhausen_Kokoron -‐ Copyright by Österreichische Akademie der Wissenschaften, Wien.
OEAW -‐ UK_Material-‐POOL -‐ Copyright by Österreichische Akademie der Wissenschaften, Wien.
OEAW -‐ Fundprotokoll_Thunau_Scan -‐
SND -‐ GIS data – no reply.
ZRC SAZU – ARKAS and ZBIVA -‐ CC Noncommercial, Share Alike (CC BY-‐NC-‐SA 2.5 SI).
ARIADNE D3.3 (Public)
47
Content Access rights
Describe the access rights to use the content in the ARIADNE project, e.g.: public access, restricted to registered users, open to consortium members for research, etc.
ADS datasets (Archsearch, Grey literature reports, individual archives, Linked Data endpoint) -‐ The Archaeology Data Service (ADS) levies no charge to use its collections when used according to the terms detailed below. However, by accepting the terms of the Common Access Agreement users are entering into a legally binding agreement. The Access Agreement covers a range of services, including both on-‐line and paper transactions, this preamble seeks to explain how it is implemented by the ADS (the ADS also formerly hosted AHDS Archaeology). It is the aim of the ADS to provide integrated access to a variety of data sets at no cost to end users. This aim can only be achieved if the organisations which supply data can be assured that their own rights are protected. Consequently, use of the catalogue is covered by a variety of legal instruments to protect us, our users, and those who supply data to us.
The Common Access Agreement asks that users be fair and reasonable in their use of the data supplied through the ADS. The ADS levies no charges, there are no documents to sign, and none will be applied retrospectively. We do not hold information on specific users. In return we ask that you acknowledge the source and copyright of the data you use, that you tell us of any errors you find in it, and that if you undertake any work based substantially upon it, that you tell us about it and send us a copy of any subsequent publication. The data must not be sold or supplied to a third party.
The data should only be used for teaching, learning, and research purposes. By teaching, we mean directed teaching undertaken with a designated tutor in a formal setting. By learning we mean self-‐directed study, whether or not attached to an educational institution, including the educational activities undertaken in museums, libraries and cognate institutions. By research we mean any work undertaken for the advancement of archaeological knowledge and/or the understanding of the historic environment. Such work may be commercially sponsored or it may be funded by academic bodies or learned societies, or it may be unsupported: but it is a condition of use that the results are placed in the public domain and are made freely available for others to use according to the normal principles of professional and academic practice.
AIAC – FASTI Online – all FASTI data is free and open access through FASTI Online.
AIAC – FASTI Online FOLD&R Journal – all FOLD&R publications are provided free and open access as hosted on FASTI Online.
ARHEO – Survey data of archaeological sites, Geophysical data, Analysis of ceramics from excavations – Public access.
ARUP-‐CAS – Archaeological map of the Czech Republic -‐ The access to the data must be structured. It is assumed the on-‐line database will be generally accessible by the public. A certain part of the snaps will be available only to registered users and another part will be closed at all, to protect archaeological sites from robbing, mainly by illegal detector users.
ARIADNE D3.3 (Public)
48
Athena-‐RC – CETI – Clay database -‐ Currently restricted, external users do not have direct access to the content, partial data are available on a project basis.
Cyi_STARC – Archaeological collections – Public access.
DAI – iDAI.images, Arachne – There are 1,4 millions images visible for everyone out of 1,7 millions images overall. For the rest there are various forms of restrictions. High-‐resolution images can only be accessed after registration and for non-‐commercial use.
DANS eDepot for Dutch Archaeology (EDNA) -‐ Metadata is open access without user registration. To get access to the data, user registration in EASY is necessary. Categories are: “Open access”, “Restricted access” or “Other access”.
DANS Digital Collaboratory for Cultural Dendrochronology (DCCD) -‐ The metadata which is free for use and open access is visible to any user who is not logged in. To get access to deeper information levels, user registration is necessary and different levels of access are applicable. For registered users research results such as wood species and absolute calendar dates are 100% open access (courtesy of the data owners, and in NL according to KNA specifications, i.e. with a lag of two years). 15% of dendrochronological time-‐series and associated files are fully open access (i.e. can be downloaded without contacting the authors); this percentage is expected to increase when the DCCD becomes linked to other data networks.
Discovery – WODAN -‐ Currently access to WODAN data is through a login procedure; however this maybe altered to provide open access to all.
Discovery – Mapping Death – Currently access to Mapping Death data; excluding associated media from commercial sources, is available freely for all users.
Discovery – Irish Stone Axe Project (ISAP) Database – Currently access to ISAP data; excluding image data is freely available for all users.
Discovery – SHARE-‐IT (Spatial Heritage Archaeological Research Environment) – Currently access to SHARE-‐IT is freely available for all users.
INRAP -‐ Archeozoom database –
• Geolocation and content databases is Open to consortium members for research; Downloading and editing allowed
• Editorial content is Public access; Display only; Downloading and editing the contents is not allowed.
INRAP – Dolia -‐ Three levels of access are available: general public access, access for external registered users, internal access for Inrap contributors. The PDF documents are not accessible when using the general public access, and the external users have access to a reduced set of data compared to the internal users. The members of ARIADNE project may be granted the access for registered users.
ARIADNE D3.3 (Public)
49
INRAP – Iconothèque – Images d’Archéologie (IDA) -‐ Editorial content: open to consortium members for research. The videos contents are accessible through streaming services and can’t be downloaded or edited. High-‐resolution images cannot be downloaded but can be linked to Inrap’s image database. Low-‐res images can be easily downloaded.
MIBAC-‐ICCU: SITAR -‐ An integration with IDEM (digital Identity framework supported in Italy by Consortium GARR) policies and on line resources federation (see at http://www.idem.garr.it) is expected in middle term to support data accesses and users’ experiences with SITAR apps and tools.
For the external Archives documents stored in SITAR web file system (e.g. public cartographic bases, historical document, etc.) access rights corresponds to the policies of the owner institution / natural person specifications.
MNM-‐NOK – Archaeological excavation datasets -‐ Datasets could be sensitive (e.g. due to looting) so it should be restricted to registered users only.
NIAM-‐BAS: Archaeological Map of Bulgaria -‐ Currently it is managed on personal and organization level (over 50 organizations) plus access that can be granted to a person or an organization to all sites/objects registered in certain region. AIS AKB is protected by an issue by the Bulgarian Ministry of Culture released in 2011 that strictly defined its structure, support and limited access. The dataset has a limited access protected by an issue by the Bulgarian Ministry of Culture released in 2011, so we are in the process of preparing an open access version with much less information.
OEAW -‐ Franzhausen_Kokoron -‐ open for research within Ariadne, public access has to be clarified.
OEAW -‐ UK_Material-‐POOL -‐ open for research within Ariadne, public access has to be clarified.
SND -‐ GIS data – In Sweden, material produced in a governmental institution (as a university) is basically regarded (with few exceptions) as open to the public, i.e. anyone can ask for the material without telling their names or the purpose for the request. However, at the same time, the copyright law in Sweden follows more or less the Bern convention (moral and economical rights). SND have in our web catalogue made the archaeological material freely available for anyone to download without registration. SND is not owner of the data and what we do with the data is regulated by a contract. This contract states that SND shall make the data freely available for anyone without registration. The Primary Investigator has nothing against that the data is used by the ARIADNE consortium for development of tools etc. The same goes for anyone anywhere finding the data via any search system etc. that ARIADNE develops. However, the primary investigator would prefer that the data is held at SND and is/will be downloaded via the SND web catalogue. The data material will be given DOI’s in a near future.
ZRC SAZU – ARKAS and ZBIVA – no reply.
ARIADNE D3.3 (Public)
50
Use of standard licences
If you have a standard access or licence agreement for your content, please provide a link/copy. Can you provide a copy of your access/licence agreement in English?
ADS datasets (Archsearch, Grey literature reports, individual archives, Linked Data endpoint) –
AIAC – FASTI Online – CC BY-‐SA 3.0
AIAC – FASTI Online FOLD&R Journal – CC BY-‐NC-‐SA 3.0
ARHEO – Survey data of archaeological sites, Geophysical data, Analysis of ceramics from excavations – not available.
ARUP-‐CAS – Archaeological map of the Czech Republic – Not available
Athena-‐RC – CETI – Clay database – Not applicable.
Cyi_STARC – Archaeological collections – no standard licence.
DAI – iDAI.images, Arachne – http://creativecommons.org/licenses/by-‐nc-‐nd/3.0/
DANS eDepot for Dutch Archaeology (EDNA) -‐ http://www.dans.knaw.nl/en/content/data-‐archive/legal-‐information.
DANS Digital Collaboratory for Cultural Dendrochronology (DCCD) -‐ http://dendro.dans.knaw.nl/termsofuse.
Discovery – WODAN -‐ Currently no license agreement exists.
Discovery – Mapping Death – Currently no license agreement exists.
Discovery – Irish Stone Axe Project (ISAP) Database – Currently no license agreement exists.
Discovery – SHARE-‐IT (Spatial Heritage Archaeological Research Environment) – Currently no license agreement exists.
INRAP -‐ Archeozoom database –
Databases (open content): Geolocation and content databases will soon be opened through the French Open Data platform http://data.gouv.fr, under a public licence http://www.data.gouv.fr/Licence-‐Ouverte-‐Open-‐Licence
Open Licence (english version): http://ddata.over-‐blog.com/xxxyyy/4/37/99/26/licence/Licence-‐Ouverte-‐Open-‐Licence-‐ENG.pdf
ARIADNE D3.3 (Public)
51
Editorial content (open access):. All editorial contents provided by Inrap on their online platforms are licensed under a Creative Commons Attribution-‐NonCommercial-‐NoDerivs 3.0 Unported License
INRAP – Dolia -‐ Not applicable for the whole content. See above for licence issues regarding metadata.
INRAP – Iconothèque – Images d’Archéologie (IDA) -‐ content databases will soon be opened through the French Open Data platform http://data.gouv.fr, under a public licence: http://www.data.gouv.fr/Licence-‐Ouverte-‐Open-‐Licence. Open Licence (english version): http://ddata.over-‐blog.com/xxxyyy/4/37/99/26/licence/Licence-‐Ouverte-‐Open-‐Licence-‐ENG.pdf
The low-‐res images and their caption may be used freely for a non-‐commercial use. The terms of use page is in French only: http://images-‐archeologie.fr/Accueil/Pages-‐annexes/Conditions-‐d-‐utilisation/p-‐32-‐Conditions-‐generales-‐d-‐utilisation.htm and licensed under a Creative Commons Attribution-‐NonCommercial-‐NoDerivs 3.0 Unported License.
MIBAC-‐ICCU: SITAR: • CC-‐0 license: http://creativecommons.org/publicdomain/zero/1.0/ • CC-‐BY-‐SA-‐NC: http://creativecommons.org/licenses/by-‐nc-‐sa/3.0/legalcode • For the external archive documents that may be stored in SITAR web file system (e.g. public
cartographic bases, historical document, etc.) the owning institution/legal person supplies the corresponding data access & use licenses.
MIBAC-‐ICCU: CulturaItalia -‐ CulturaItalia has two different type of agreement based on
• CC 2.5 http://www.culturaitalia.it/opencms/export/sites/culturaitalia/attachments/ linked_open_data/Licenza_CulturaItalia_CC_2_5.pdf
• CC0 http://www.culturaitalia.it/opencms/export/sites/culturaitalia/attachments/ linked_open_data/Licenza_CulturaItalia_CC0.pdf
MNM-‐NOK – Archaeological excavation datasets -‐ We do not have a standard access or licence agreement.
NIAM-‐BAS: Archaeological Map of Bulgaria -‐ We do not have a standard access or licence agreement.
OEAW -‐ Franzhausen_Kokoron -‐ http://epub.oeaw.ac.at/franzhausen-‐kokoron2/Impressum.htm.
OEAW -‐ UK_Material-‐POOL – no reply.
SND -‐ GIS data – Accessibility levels at SND: http://snd.gu.se/en/search-‐and-‐order-‐data/accessibility-‐levels. For the GIS-‐material the access level is set to 1a.
ZRC SAZU – ARKAS and ZBIVA -‐ http://creativecommons.org/licenses/by-‐nc-‐sa/3.0/deed.sl.
ARIADNE D3.3 (Public)
52
Metadata rights
Can metadata be made available to ARIADNE under CC0 or a Public Domain license? ADS datasets (Archsearch, Grey literature reports, individual archives, Linked Data endpoint) – All of our metadata can be made available under CC0.
AIAC – FASTI Online – Yes. We already provide RDF data for FASTI place references to the Pelagios project licensed under a Public Domain licence.
AIAC – FASTI Online FOLD&R Journal – Yes.
ARHEO – Survey data of archaeological sites, Geophysical data, Analysis of ceramics from excavations – Free for use, open access. Yes, metadata can be provided under CC0.
ARUP-‐CAS – Archaeological map of the Czech Republic -‐ Metadata has been created by ourselves; we are ready to provide it to the public within the ARIADNE project under the CC0 license.
Athena-‐RC – CETI – Clay database – Not available. Currently not possible to make available under the CC0 license.
Cyi_STARC – Archaeological collections -‐ Free for use, open access. Yes, metadata can be provided under CC0.
DAI – iDAI.images, Arachne – No restrictions. Yes, metadata can be provided under CC0.
DANS eDepot for Dutch Archaeology (EDNA) – Free for use, open access. Yes, metadata can be provided under CC0.
DANS Digital Collaboratory for Cultural Dendrochronology (DCCD) -‐ Yes. The metadata which is free for use and open access is visible to any user who is not logged in.
Discovery – WODAN – As all the data within this database could be considered as metadata the same conditions apply as for the content. The database stores content form many experts and the license ultimately remains a choice of theirs. However, all experts have been advised to be open with their data and adopt CC Attribution-‐NonCommercial-‐ShareAlike 3.0 Unported.
Discovery – Mapping Death – As all the data within this database could be considered metadata the license conditions will be the same as for the content i.e. CC Attribution-‐NonCommercial-‐ShareAlike 3.0 Unported.
Discovery – Irish Stone Axe Project (ISAP) Database – As all the data within this database could be considered metadata the license conditions will be the same as for the content i.e. CC Attribution-‐NonCommercial-‐ShareAlike 3.0 Unported.
ARIADNE D3.3 (Public)
53
Discovery – SHARE-‐IT (Spatial Heritage Archaeological Research Environment) – None. Yes, metadata can be made available under the CCO license.
INRAP -‐ Archeozoom database – Part of the metadata (editorial content) is subject to copyright from the authors. Credits and Licenses must be mentioned. Geolocation and content databases can be made available to ARIADNE. Editorial content cannot be made available to ARIADNE but can be linked to Inrap’s oline platforms. No the metadata cannot be made available under the CC0 license.
INRAP – Dolia – Part of the metadata, esp. the abstract, is subject to copyright from the authors. However, part of the metadata (reduced set of metadata, related to a subset of the bibliographic records) will soon be opened through the French Open Data platform http://data.gouv.fr, under a public licence which is available in French and in English (see Appendix 5).
INRAP – Iconothèque – Images d’Archéologie (IDA) – Part of the metadata (editorial content) is subject to copyright from the authors. Credits and Licenses must be mentioned. No. Editorial content cannot be made available to ARIADNE but can be linked to Inrap’s oline platforms.
MIBAC-‐ICCU: SITAR -‐ Work in progress, available in middle term and expected under CC-‐0 license for SITAR primary.
MIBAC-‐ICCU: CulturaItalia – Yes, the metadata can be made available under the CC-‐0 license.
MNM-‐NOK – Archaeological excavation datasets -‐ In Hungary archaeologists are given five years to assess their excavated materials. Therefore, the archaeological dataset could be made public only when the five years are over for a given excavation. After that the data’s right belong to the project partner. Datasets of sites and scientific analyses belong to the project partner without time restriction. Yes, metadata can be made available under the CC0 license.
NIAM-‐BAS -‐Archaeological Map of Bulgaria -‐ It must be determined. It will be limited for some elements.
OEAW -‐ Franzhausen_Kokoron – Online version: no restrictions.
OEAW -‐ UK_Material-‐POOL – no restrictions.
SND -‐ GIS data – Free for use, open access. Yes, the metadata can be made available under the CC0 license.
ZRC SAZU – ARKAS and ZBIVA – Our current answer is no to the CC0 license. We had in mind: CC Noncommercial, Share Alike (CC BY-‐NC-‐SA 2.5 SI). That being said, we don't have any law experts in our team and are struggling a bit on the issue. In other words, if it turns out that (i) most of ARIADNE members are willing/capable to share under CC 1.0. or (ii) that this is a must for ARIADNE's success, we will reconsider our decision.
ARIADNE D3.3 (Public)
54
Appendix 2: DANS Licence Agreement and help text
By signing the agreement, you (the Depositor) undertake to comply with the provisions of the Licence Agreement concluded with DANS (the Repository).
DANS requires all users to register when they first apply.
The depositor’s details, which you provided when you registered with the online archiving system EASY, are stated at the beginning of the agreement. It is important that the DANS licence agreement is concluded with the person/organisation that holds the rights to the dataset. The name of the person or organisation must be stated. If more than one person or organisation holds the rights to the dataset, all the persons/organisations concerned must be party to the agreement.
Who holds the rights to the dataset? An explanation is provided here:
A dataset may be protected by copyright. This is the case if all or part of a dataset is protected by copyright because it contains texts produced by others, for example. In such cases, the person holding the rights to the dataset is the person ‘according to whose design and under whose supervision the work was produced’.
Datasets that are not protected by copyright may still fall within the scope of the Databases Act (Databankenwet). This is the case if a dataset is an original creation containing information that has been structured and/or combined and/or processed by the researcher in a manner that involved a ‘substantial investment’ of time, money or human resources.
Part or all of the data may be protected by copyright, or it may be the case that there is no (or no longer) copyright on the data, as is the case with government information from public archives, for example. In the latter case (no copyright on the dataset), the person who holds the rights to the dataset is ‘the producer of the database”, i.e. the person/organisation that bore the risk of the investment in the database.
If the depositor does not hold the rights to the dataset (according to the Copyright Act or the Databases Act), the dataset must be deposited in the name of the person/organisation that holds the rights. If the dataset is deposited in the name of an organisation, the name of the organisation must be specified under ‘Name’, together with the name and position of the person representing the organisation.
If you are depositing a dataset in your own name, and not on behalf of an organisation, you should specify ‘Private’ under ‘Organisation’.
ARIADNE D3.3 (Public)
55
Explanation of Articles:
Article 1
DANS requests you to grant a ‘non-‐exclusive licence’. This means that you deposit your dataset with DANS, but you are entirely free to deposit and/or grant access to your dataset elsewhere.
You give DANS permission to make your dataset available, via EASY, to registered DANS users in accordance with the agreed access category (see below). Users may copy the dataset for their own personal use only, and must not use it for commercial purposes, or sell/resell it.
Article 2
DANS assumes that you are authorised to control and manage the dataset(s) you deposit. This means that no part of your dataset must be protected by the copyright of others. This may be the case, for example, if your file contains a photograph that is protected by copyright owned by someone else. DANS accepts no liability for damage, losses or legal action arising in this regard; you will remain responsible for this. If you do not own the copyright to your dataset, DANS asks you, in order to avoid difficulties, to contact in advance any other persons who hold rights to the dataset.
Article 3
DANS will make every effort to archive your data in a sustainable form, i.e. to ensure that the data remain accessible and readable in the long term. In doing so, DANS will preserve the dataset(s) in its/their original software format as far as possible. However, if this is not possible for technical or financial reasons, DANS will be obliged to modify the form and/or functionality of the dataset.
Article 4
You remain responsible for the content of your dataset. This means that the content of your dataset must correspond to the specification you provide, and must not include any data that are contrary to Dutch law.
Article 5
In principle, your dataset cannot be removed from the archive once it has been deposited, even if it is later discovered to contain errors. This is important when users refer to your dataset in a publication. Please note that it is always possible to deposit new, updated versions of a dataset. DANS will not remove a dataset from the archive unless there are serious grounds for doing so.
Article 6
If your dataset contains personal data, it must not, under any circumstances, be made available under the access category “Open Access”. Please see below. ‘Personal data’ are data that relate to identify or identifiable natural persons. Obviously, DANS is required to comply with the provisions of the Personal
ARIADNE D3.3 (Public)
56
Data Protection Act (Wet Bescherming Persoonsgegevens). If your dataset contains personal data, you should contact DANS before depositing it. Personal data may be consulted only for the purpose of scientific, scholarly, statistical or historical research. DANS requires data users to comply with the Code of Practice for the use of personal data in scientific and scholarly research (Gedragscode voor gebruik van persoonsgegevens in wetenschappelijk onderzoek) published by the VSNU (Association of Universities in the Netherlands).
Metadata (descriptions of datasets) are always freely accessible. If the documentation you provide with your dataset includes documents that are (still) confidential, you must inform DANS of this. These documents will not be made available to users.
Article 7
When registered data users reproduce data from your dataset in a publication, they are required to include an acknowledgement containing certain basic details relating to the dataset (see Conditions of Use, Article 1), i.e. the repository (DANS), the identifier, name of the creator, title of the dataset and the date on which it was created.
Article 8
If the depositor dies, or the depositor’s organisation ceases to exist, the relevant datasets will no longer be made available under access category “Restricted Access” but will automatically be transferred to access category “Open Access”, as specified at the end of the agreement, irrespective of previous arrangements. This is not applicable if the dataset contains personal data (see article 6).
Article 10
A dataset may be regarded as a database as referred to in the Dutch Databases Act (Databankenwet). Under that Act, the rights of the producer of a database lapse automatically after a period of fifteen years, which commences on 1 January of the year following the year in which the database was completed. When the Licence Agreement ends, DANS is still authorised to make the dataset available to others.
Notes on DANS access categories
The access category is one of the most important aspects of the Licence Agreement with DANS. By means of these categories, you specify to whom you wish to make your data available. Access conditions may vary between the different parts (files) of the dataset.
DANS advocates the Open Access movement. This means that DANS promotes the free availability of research data and publications. However, there may be justified reasons for not making research data freely available, or for not doing so until a later date. This may be because they contain personal data, or there may be a temporary restriction because of a thesis/publication that is about to be completed.
ARIADNE D3.3 (Public)
57
There may also be contractual obligations with third parties. DANS therefore allows data to be made available with Restricted Access.
Open Access
If you wish to allow Open Access to your research data, you opt for this access category. This means that any user has immediate and unrestricted access to your dataset via EASY. Each time a dataset is downloaded, the user's details are recorded. This information is accessible to all users via EASY.
Restricted Access
Select this category if you wish to restrict access to your data on a permanent or temporary basis. Your dataset will be made available only to the persons and/or organisations you specify. You will receive an e-‐mail from DANS with the name of the person who wishes to access your data. You can then grant or deny access. See also Article 8.
Other access: no access via EASY
This is a special access category. It is only for datasets that will not be made available via the DANS website. Please select this category if your datasets are made available elsewhere (e.g. by you personally), but you still wish to archive them with DANS with a view to preserving them for the long term. If you wish to use this access category, please consult DANS first in all cases.
Temporary restriction: Embargo
Regardless which access category you have chosen you have always the possibility to make your dataset temporarily unavailable for anyone. This embargo period cannot be longer than two years and cannot be extended. When this period elapses.
Data Archiving and Networked Services (DANS) Version 3.3 UK, Sept. 2011 Anna van Saksenlaan 10, 2593 HT The Hague P.O. Box 93067, 2509 AB The Hague T +31 (0)70 3446 484, F +31 (0)70 3446 482, E [email protected] DANS is an institute of KNAW and NOW
ARIADNE D3.3 (Public)
58
Appendix 3: The Terms of Use and Access to ADS Resources
Online: http://archaeologydataservice.ac.uk/advice/termsOfUseAndAccess
Unless a form of Creative Commons licence is clearly attached to a particular collection and the Creative Commons logo is prominently displayed on that collection's introduction page the following terms of use and access apply.
The terms of use and access to ADS resources are defined in two documents:
• Copyright and Liability Statement • Common Access Agreement
Copyright and Liability Statements
Preamble
Archaeological data are the product of many years of scholarship by numerous archaeologists, collectors, analysts, antiquarians, field workers and laboratory scientists. It is not possible to credit all these scholars individually, or sometimes even identify them, but in making their data available for re-‐use, the ADS acknowledges their contribution. At all times, the ADS seeks to protect the intellectual property rights and copyright of the originators of data where that can reasonably be achieved. The catalogue also includes links to other service providers. It is the responsibility of users to acknowledge and comply with the copyright conditions that may be imposed by other service providers.
Questions about copyright should be addressed to the Archaeology Data Service, King's Manor, The University of York, York YO1 7EP, UK. See also the ADS Common Access Agreement
Copyright Statement
All material on this web server is protected by copyright. Specific copyright holders are identified in the appropriate pages and sections. In all other instances, copyright is retained by The University of York on behalf of the Archaeology Data Service (ADS). Links to materials on other servers should not be construed as a claim over them. All rights reserved.
A non-‐exclusive, non-‐transferable licence is hereby granted to those using or reproducing, in whole or in part, the material for valid not-‐for-‐profit teaching and research purposes, providing the copyright owners are acknowledged. The Archaeology Data Service should be cited as the source of the material from this server. Where specific permission to use material is required, this is identified and such permission must be sought from the copyright holder or agency cited. Anyone wishing to use the catalogue for any other purpose must contact the ADS to seek permission.
ARIADNE D3.3 (Public)
59
Liability Statement
The Archaeology Data Service (ADS) catalogue of collections is a catalogue, and as such, the maintainers of the catalogue explicitly disclaim to the extent permitted by law any responsibility for the accuracy, content, or availability of Information located through use of the catalogue, or for any damage incurred owing to use of the information contained therein.
Information located through use of the catalogue may be subject to specific use constraints, details of which will be made accessible to potential users by the catalogue. It is the responsibility of potential and actual users to be aware of such constraints and to abide by them.
By making use of material on this web server, including the contents of the catalogue, you accept these copyright and disclaimer provisions.
The Archaeology Data Service (ADS) was founded by a consortium comprising the Council for British Archaeology with the Universities of Birmingham, Bradford, Glasgow, Kent at Canterbury, Leicester, Newcastle, Oxford and York. The ADS is guided by an management committee consisting of representatives from all sectors of the discipline. The ADS is based at the University of York.
The Archaeology Data Service is part funded by the Arts and Humanities Research Council.
Common Access Agreement
Preamble
The Archaeology Data Service (ADS) levies no charge to use its collections when used according to the terms detailed below. However, by accepting the terms of the Common Access Agreement users are entering into a legally binding agreement. The Access Agreement covers a range of services, including both on-‐line and paper transactions, this preamble seeks to explain how it is implemented by the ADS (the ADS also formerly hosted AHDS Archaeology). NB: this preamble is an explanatory aid, not a legal instrument. Only the "Terms" below are legally binding.
It is the aim of the ADS to provide integrated access to a variety of data sets at no cost to end users. This aim can only be achieved if the organisations which supply data can be assured that their own rights are protected. Consequently, use of the catalogue is covered by a variety of legal instruments to protect us, our users, and those who supply data to us.
The Common Access Agreement asks that users be fair and reasonable in their use of the data supplied through the ADS. The ADS levies no charges, there are no documents to sign, and none will be applied retrospectively. We do not hold information on specific users. In return we ask that you acknowledge the source and copyright of the data you use, that you tell us of any errors you find in it, and that if you undertake any work based substantially upon it, that you tell us about it and send us a copy of any subsequent publication. The data must not be sold or supplied to a third party.
The data should only be used for teaching, learning, and research purposes. By teaching, we mean directed teaching undertaken with a designated tutor in a formal setting. By learning we mean self-‐
ARIADNE D3.3 (Public)
60
directed study, whether or not attached to an educational institution, including the educational activities undertaken in museums, libraries and cognate institutions. By research we mean any work undertaken for the advancement of archaeological knowledge and/or the understanding of the historic environment. Such work may be commercially sponsored or it may be funded by academic bodies or learned societies, or it may be unsupported: but it is a condition of use that the results are placed in the public domain and are made freely available for others to use according to the normal principles of professional and academic practice.
Users should also read the ADS Copyright and Liability Statement .
Terms
Common Access Agreement for Individuals
By using the ADS catalogue you are bound by the following Terms & Conditions. Please read them carefully. I have read and I understand the following statement: All material supplied via the ADS is protected by copyright and other intellectual property rights, and duplication or sale of all or part of any of the Data Collections is not permitted, except that material may be duplicated by you for your research use or educational purposes in electronic or print form. You must obtain permission for any other use. Electronic or print copies may not be offered, whether for sale or otherwise to anyone who is not an authorised user.
In addition, with reference to any data materials supplied to me by the ADS, I hereby undertake:
1. To use and to make personal copies of any part of the Data Collections only for the purposes of non-‐commercial research or teaching, as specified in the accompanying application.
2. To give access to the Data Collections, in whole or in part, or any material derived from it, only to registered users who have received permission from the ADS to use it, with the exception of data supplied for the stated purpose of teaching.
3. To give access to data supplied for the stated purpose of teaching only to students who have signed a Students' Undertaking Form, or have signed an equivalent institutional document.
4. To abide by any conditions, displayed on screen or otherwise notified to me, that may apply to the access to, or use of, specific datasets within the Data Collections.
5. To preserve at all times the confidentiality of information pertaining to identifiable individuals that is recorded in the Data Collections. In addition, where so requested, to preserve the confidentiality of information about, or supplied by, organisations recorded in the Data Collections. In particular I undertake not to use or attempt to use the Data Collections to deliberately compromise or otherwise infringe the confidentiality of individuals or organisations.
6. To meet any charges levied by the ADS for the supply of the Data Collections including, where relevant, annual service fees.
I understand that breach of the above provisions of this Agreement will lead to immediate termination
ARIADNE D3.3 (Public)
61
of my access to all ADS services, either permanently or temporarily, at the discretion of the ADS, and may result in legal action being taken against me. I further agree:
1. To acknowledge, in any publication, whether printed, electronic or broadcast, based wholly or in part on the Data Collections, the original depositors, the funders of the Data Collections (if different) and the ADS supplying the data materials, as described in the documentation accompanying the Data Collections.
2. To declare, in any publication, whether printed, electronic or broadcast, based wholly or in part on the Data Collections, that those who carried out the original collection of the data bear no responsibility for the further analysis or interpretation of it.
3. To supply the ADS with one copy of any published work based wholly or substantially on the Data Collections or alternatively with references and supporting information if this is requested.
4. That the ADS may hold the personal data submitted below for validation, statistical and billing purposes, and may pass the information on to other parties such as depositors of material contained in the ADS, and copyright and other intellectual property right owners whose material is contained in the ADS. I may request copies of data referring to me held by the ADS, and the names of the parties to whom it has been disclosed, on payment of an administrative fee of ten pounds.
5. To notify the ADS of any errors discovered in the Data Collections.
6. To accept that a service fee or other charges for provision of the Data Collections or any part thereof may be charged, and that charging for access to the Data Collections, or any part thereof, may begin or be varied, 30 days after details of the proposed changes are made available to registered site licence holders and users of the Data Collections.
7. To accept that the ADS, and the depositor of any part of the Data Collections supplied bear no legal responsibility for their accuracy or comprehensiveness, and that the ADS and the depositor of any part of the Data Collections supplied accept no liability for indirect, consequential, or incidental, damages or losses arising from use of the Data Collections, or from the unavailability of, or break in access to, the service, for whatever reason.
8. That where there is no breach of this agreement, it may be terminated, or its terms altered, by the ADS either after 30 days notice; or, if a service charge has been paid in advance, at the end of the period for which payment has been made, whichever is the longer.
ARIADNE D3.3 (Public)
62
Appendix 4: Accessibility levels at SND
Online: http://snd.gu.se/en/search-‐and-‐order-‐data/accessibility-‐levels
For access levels 1 and 2 an agreement is required between SND and the principal investigator.
Level 1 Data may be accessed without impending contact between the principal investigator and SND.
1 a Freely accessed for download without registration The material is free and accessible for all to download. Example: Swedish election manifesto and party programme.
1 b Freely accessed via ordering or Open Data (registration) To access SND's Open Data an account with email registration is required. In Open Data all studies are accessible for download and subject to certain online analysis. 1b material may also be ordered via an order form.
1 c Freely accessed via ordering Place an order via the order form by stating name, email address, point of use and so on. The researcher will not be contacted in conjunction with each order but is able to get statistics on the number of delivered orders.
Level 2 -‐ Data may be gained access to under the condition that the principal investigator is contacted at set occasions/always
2 a Requires permission from the principal investigator at set occasions Place an order via the order form by stating name, email address, point of use and so on. SND contacts the principal investigator at set occasions (for example orders by students) for approval based on the submitted order form. The user must agree to a contract in order to gain access to the material.
2 b Always require permission from the principal investigator Place an order via the order form by stating name, email address, point of use and so on. SND always contacts the principal investigator for approval based on the submitted order form. The user must agree a contract in order to gain access to the material.
Level 3 -‐ Applies to studies where the research material is described only with meta data and are not deposited at SND. For these levels no contract between the researcher or SND is required.
3 a The study is not available through SND. It can be downloaded from an external website A description of the study is visible and searchable at SND. The research material is available through the project's website/database.
ARIADNE D3.3 (Public)
63
3 b The study is not available through SND. Please contact the principal investigator for more information A description of the study is visible and searchable via SND. The study is not accessible through the Internet. To gain access or find additional information, contact the principal investigator.
ARIADNE D3.3 (Public)
64
Appendix 5: Data.Gouv.FR – Open Licence
OPEN LICENCE You may re-‐use the « Information » made available by the « Producer » under the freedoms and the conditions specified by this licence. Re-‐use of Information under this licence The « Producer » grants the « Re-‐user » a worldwide, perpetual, free of charge, non-‐exclusive, personal right to use the « Information » subject to this licence under the freedoms and the conditions set out below. You are free to re-‐use the « Information »:
• To reproduce, copy, publish and transmit the « Information » ;
• To disseminate and redistribute the « Information » ;
• To adapt, modify, transform and extract from the « Information », for instance to build upon it in order to create « Derivative information » ;
• To exploit the « Information » commercially, for example, by combining it with other
« Information », or by including it in your own product or application. You must, where you do any of the above:
1. Attribute the « Information » by acknowledging its source (at least the name of the « Producer ») and the date on which it was last updated.
The « Re-‐user » may fulfil this condition by providing one or more hypertext links (URL) referring to the « Information » and effectively acknowledging its source.
This attribution shall not suggest any official status or endorsement, by the « Producer » or any other public entity, of the « Re-‐user » or the re-‐use of the « Information ».
http://wiki.data.gouv.fr/wiki/Licence_Ouverte_/_Open_Licence