Top Banner
Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice' IST- 2001- 320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Open Archives Forum - Technical Validation - Birgit Matthaei Humboldt University Berlin, Germany Computer and Media Service, Electronic Publishing Group [email protected]
19

IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Mar 27, 2015

Download

Documents

Wyatt Roche
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Open Archives Forum- Technical Validation -

Birgit Matthaei

Humboldt University Berlin, Germany

Computer and Media Service, Electronic Publishing Group

[email protected]

Page 2: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Creating Information Sources

European portal for open archives information • Information Resource Database (registry of repositories,

services, software, projects and associated organisations

Evaluation of status, experiences and future plans regarding European OAI implementations

• Online “Technical Validation Questionnaire” Systematic inventories

• Repositories, services and tools

Reports on own experiences• OAI-PMH 2.0 alpha and beta tester, • Implementation of OAI Services • experiences software tools

Highlight some aspects of european activities on OAI in relation to worldwide activities

Page 3: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Overview Repositories - World

.

Upgrading to OAI 2.0 (Nov. 2002)

OAI 2.040%

OAI 1.160%

Upgrading to OAI 2.0 (Aug. 2003)

OAI 1.117%

OAI 2.083%

9382

6 5 4 1

0

20

40

60

80

100

Overview on OAI activity (continents)

America (North)

Europe

Australia

America (Middle & South)

Asia

Africa

Page 4: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Overview Repositories - Europe

.

22

18

11

7

32

1

0

5

10

15

20

25

Overview of european countries engaged in OAI implementation UKGermanyFranceItalySwedenAustriaNetherlandsBelgiumBelorussiaDenmarkFinlandIrelandNorwayPortugalRussiaSloveniaSpainSwitzerland

Page 5: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Details of Questionnaire

.

Data Provider

22

10

19

0 5 10 15 20 25

planned

in development

active

Service Provider

16

11

5

0 5 10 15 20 25

planned

in development

active

Page 6: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Questionnaire: Community Types

.

no specification

13%Library

31%

Archive19%

Museum8%

Publisher2%

Preprint/Science

13%

Others14%

Multiple answers possible

Page 7: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Questionnaire: Object Types

.

Multiple answers possible

23

17

8

6

5

1

1

1

2

15

14

9

12

1

3

1

1

0 5 10 15 20 25 30 35 40

Metadata

Fulltext documents

Abstracts

Images - digitised mat.

Images - Vector graphics

Video/Streams

Software

Audio

Raw/Statistic Data

Others

already OAI compatible not yet OAI compatible

Page 8: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Questionnaire: Content Types

.

Multiple answers possible

14

13

9

6

6

2

15

8

8

8

6

4

2

11

0 5 10 15 20 25 30

Dissertations

Journal Articles

Preprints

Conference Proceedings

Lectures

Recordings

Others

already OAI compatible not yet OAI compatible

Page 9: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Questionnaire: Used Software

Many different tools were mentioned: ADLIB, ARNO, CDSware, DIENST, DSpace, Elektra, EPrints, PERL implementations, OAI Cat, OAI Harvester, VT-ETB-db

Today: 50 % use self developeded toolsOne year ago: 80 % used self-developed tools

Trend: Need of tools which are user-friendly complete solutions

cover typical functionalitiesto be installed by relatively small expenditureadaptable to special requirements if necessarylittle expenditure with the further care of the data

Page 10: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Tools: Eprints - DSpace

GNU Eprints - developed at the Electronics and Computer Science Department of the University of Southampton, UK

DSpace - newly developed as a joint project of the MIT Libraries and the HP Company, USA

Some numbers of the inventory of repositories:

Nov 2002 other tools74% eprints

26%

Aug 2003

eprints39%

other tools61%

Page 11: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Tools: Eprints - DSpace

Open source developments for archiving

Nearly identically in their functionalitysearch functions, document archiving, online interfaces for self archiving, integration of the OAI PMH, …

Systems base on different technologiesEprints: traditional technologies, runs on pure open source systems: mySQL and Apache, programmed by using the script language “Perl”

Dspace: operates with new technologies such as the Postgres database and Tomcat for jsp/java web application, higher performance

Page 12: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Questionnaire: Metadata Formats

.

Multiple answers possible

17

10

4

3

1

1

1

12

7

5

2

2

3

2

2

3

6

0 5 10 15 20 25

Dublic Core simple

Dublin Core qualified

MARC 21

UNIMARC

EAD

MAB

TEI

METS

Others (single mentioned)

already OAI compatible not yet OAI compatible

Page 13: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Problems for Service Provider

Problem: Standardisation“heterogeneity of the content of the metadata records requires the service provider to expend a lot of effort in normalizing the data in order to make it more comparable and usable”

could be done at lesser cost by the individual DP

or by the development of middleware tools that service providers could use for data normalisation

Lacking interoperabilitydifferent metadata standards, terminology, languages, access strategies, interfaces / transfer protocols, copyright regulations

Difficulty to establish joint services based on open archives

Page 14: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Creating Recommendations

Example of a possible solution:German Initiative for Networked Information:

Recommendations for usage of OAI-PMH created by DINI-OAI working group (http://www.dini.de/) target: agreement on syntax and semantics of OAI set

definitions for German data and service providers enhance retrieval quality and support subject gateways

(e.g. Physnet, Dissertation search engine, ...) definition of three classification types

subjects (according to DNB)formal publication types (e.g. dissertation)formal document types (e.g. text, audio)

example service provider based on recommended sets: http://edoc.hu-berlin.de/e_suche/oai.php

Page 15: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

OAI advantages ?

Importance of OAI - provide additional services to existing services- replace existing services through OAI interface- better retrieval, make Metadata exchange available

Advantages of OAI- share scientific knowledge, harvest other knowledge

databases, cross-search in institutional assets- major dissemination of researchers' results- simple and cheap in implementation

„provide access to all of human knowledge“ „nothing other than political expediency“

Page 16: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

web

database

http://www.oaforum.org/oaf_db/

Page 17: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

web

questionnaire

http://www.oaforum.org/resources/tecvalq2.php

Page 18: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

web

project documents

http://www.oaforum.org/documents/

Page 19: IST- 2001-320015 Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,

Birgit Matthaei, 4th Sept. 2003, Bath, 4th OAForum Workshop: 'In Practice, Good Practice'

IST- 2001-320015

Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group

Thank you!

Birgit Matthaei

Humboldt University Berlin, Germany

Computer and Media Service, Electronic Publishing Group

[email protected]