Top Banner
etb . eun . org 12.03.2001 Kluck (HUB/IZ) 1 ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB Metadata and Handling of Heterogeneity as Central Means for the Development of an European School Portal - The Project European Schools Treasury Browser – ETB Presentation at the 7 th Annual Meeting of the IuK Initiative Trier 11.-14.03.2001 Michael Kluck Humboldt University Berlin, Computer Uses in Education (HUB) Social Sciences Information Centre Bonn (IZ)
25

Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

Mar 26, 2015

Download

Documents

John Jimenez
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

1

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Metadata and Handling of Heterogeneity as Central Means

for the Development of an European School Portal - The Project

European Schools Treasury Browser – ETBPresentation at the 7th Annual Meeting of the IuK Initiative

Trier 11.-14.03.2001

Michael KluckHumboldt University Berlin, Computer Uses in Education (HUB)

Social Sciences Information Centre Bonn (IZ)

Page 2: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

2

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Introduction (I)Introduction (I)

The ETB project is embedded in the context of the European Schoolnet (EUN) www.eun.org

The European Schoolnet is the new framework for the co-operation between the European Ministries of Education on Information and Communication Technology in Education.

EUN builds a European network of national and regional computer networks of repositories on schools.

Page 3: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

3

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

BUILD THE BUILD THE ““SCHOOLNET INFORMATION SPACESCHOOLNET INFORMATION SPACE””

Page 4: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

4

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Introduction (II)Introduction (II) ETB works out the technological and structural

prerequisites for this network of networks. Building on a preceding project, ETB shall realise the

technical infrastructure and the content-based integration of the different services and of their cultural and linguistic contexts.

The presentation is concentrated on the content integration of the participating networks and repositories.

The main user groups will be teachers and pupils.

Page 5: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

5

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Developing a CommonDeveloping a CommonMetadata SetMetadata Set

Context and General purpose:Get similarly structured informationFacilitate targeted searchAvoid mismatch of the specific search and the

unstructured universe of the Internet: - Topic versus person (i.e. Ohm, Kierkegaard)- Different domain-specific meanings (i.e. Leistung,

Disziplin)- Domain-specific meaning versus general meaning (i.e.

Lehre, services)

Page 6: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

6

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Metadata

Derived from the Dublin Core metadata elements and the EUN Metadata Element Set (developed in the preceding EUN project)

Quite minimalised, but with obligation types M = mandatoryO = optional

Using RDF syntax

Page 7: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

7

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Metadata Elements (I)Title MCreator MSubject O or M?!Description MPublisher OContributor ODate OType O

Page 8: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

8

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Metadata Elements (II)Format O Identifier MSource OLanguage MRelation OCoverage ORights ManagementOAudience OEUN User Level O

Page 9: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

9

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Metadata Elements (III)

Element Subject Besides freely chosen keywordsETB thesaurus termsSound or video clip representing the

content of an audio, audiovisual, visual or multimedia resource

Page 10: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

10

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Metadata Elements (IV)

• Element EUN User Level- School level or age group

- Pre-school (education)- Primary (education)- AdultEducation- Secondary (education)- Vocational (eduction and training)- HigherEducation- Juvenile (material for children and adolescents in

general)- Adult (material for adults in general)

Page 11: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

11

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Producing Metadata

Direct entry by authors (adapting given rules/definitions or using an online template)

Generation by repositories during input Extraction from existing un-coded data by

defining extraction rules

Page 12: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

12

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Metadata Extraction and MappingMetadata Extraction and Mapping

For different repositories which have different metadata structures mapping schemes will be set up into the ETB Metadata Element Set.

For repositories without metadata schemes metadata will be extracted from the entries as far as structured elements of the resources can be detected and an algorithm for converting them into metadata fields can be applied.

Page 13: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

13

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Metadata Exchange via NNTPMetadata Exchange via NNTP

Page 14: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

14

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Page 15: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

15

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Technical Goals of ETBTechnical Goals of ETB

A new approach for a European Network of repositories

Network based on “Publish” not “Pull” Added value to users from a thesaurus Retain full local editorial policy High quality control tools Wider outreach Support of multilinguality

Page 16: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

16

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Thesaurus (I)ETB Thesaurus (I)

Search problemsNatural language problems:

- Synonymy, homonymy, polysemy, phrases, compounds, spelling variations

Lack of relevance controlMultilinguality

Page 17: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

17

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Thesaurus (II)ETB Thesaurus (II)

Thesaurus benefitsEffective control of indexing language (preferred

terms, inter-language equivalence)Systematic display of descriptors (ease of

navigation through the terminology) Indexing and searching by using post-coordinationFollowing recommendations of Dublin CoreBasics for solving heterogeneity

Page 18: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

18

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ETB Thesaurus (III)ETB Thesaurus (III)

The content of the repositories in the EUN context (= multimedia material, teaching material, school projects) and schools as target area and teachers and pupils as main target groups need specific terminology.

Only few repositories have developed an own terminology.

Page 19: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

19

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Handling Heterogeneity (I)Handling Heterogeneity (I) Making use of existing content descriptions Dealing with heterogeneity on the content level means:

Same words or phrases may indicate different meanings in different environments (i.e. education, or class):

- Occurring anywhere in the full text of an Internet resource

- Being the code of an classification scheme assigned to an document

- Being an indexing term taken from a specific thesaurus

Page 20: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

20

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Handling heterogeneity (II)Handling heterogeneity (II)

Use of existing intellectual work done by the different repositories or resource authors: indexing or classifying documents even with different schemes or terminologies

Use of existing terminologies or classification schemes for automatic processing of transfer relations

Page 21: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

21

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Handling heterogeneity (III)Handling heterogeneity (III)

Methods for solving heterogeneity problems Intellectual building of cross-concordances

between relevant terminologies and classification schemes and between different languages, and automatic (statistical) building of transfer components

Developing transfer components in between those terminologies and schemes and between those and the words occurring in the full texts (co-occurrence analysis, fuzzy methods, neural networks etc.)

Page 22: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

22

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Multilingual AccessMultilingual Access Using ETB thesaurus and heterogeneity handling

ETB thesaurus allows indexing or searching in any covered language and results can automatically be retrieved in all other languages.

Heterogeneity handling (intellectually or automatically processed) allows the use of any (language specific) scheme: results can also be retrieved in other schemes or languages.

Integration of results in the area of cross-language information retrieval and its evaluation (see: CLEF = Cross-Language Evaluation Forum at www.clef-campaign.org )

Page 23: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

23

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ConclusionConclusion

ETB is strongly integrated in an existing and rapidly developing application for practitioners (teachers and pupils) with a good political support for handling ICT in education.

ETB is strongly integrated into top level research on distributed networking, metadata, (cross-language) information retrieval, multilingual thesauri, and heterogeneity handling.

Page 24: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

24

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

Thank you for your attention!Thank you for your attention!

Further informationOn the multilingual ETB thesaurus

http://www.en.eun.org/eun.org2/eun/en/etb/content_frame.cfm?lang=en&ov=3813

On other aspects of the ETB Project (collection description, quality management, technical solutions)

http://www.en.eun.org/eun.org2/eun/en/etb/sub_area_frame.cfm?sa=195&row=1

Michael Kluck‘s publications http://www.educat.hu-berlin.de/~kluck/kl-personal.html

Page 25: Etb.eun.org ETB IST 1999 - 11781 IuK 2001 Metadata + Heterogeneity in ETB 12.03.2001 Kluck (HUB/IZ) 1 Metadata and Handling of Heterogeneity as Central.

etb.eun.org

12.03.2001Kluck (HUB/IZ)

25

ETB IST 1999 - 11781

IuK 2001Metadata + Heterogeneity in ETB

ReferencesReferences Ardö/Koch 1999: Anders Ardö, Traugott Koch: Automatic classification applied to the full-text Internet documents in a

robot-generated subject index. In: Online Information 99. Proceedings. 23rd International Online Information Meeting. London, 7-9 Dec 1999, p.239-246. Manuscript at: http://www.lub.lu.se/~traugott/online99.htm  

Kluck et al. 2000: Michael Kluck, Jürgen Krause, Matthias Müller, in Kooperation mit Rudi Schmiede u.a. Virtuelle Fachbibliothek Sozialwissenschaften. Bonn: 2000 (= IZ-Arbeitsbericht, Nr. 19); at http://www.bonn.iz-soz.de/publications/series/working-papers/#Virtuell pdf-file for downloading.  

Koch/Vizine-Goetz 1999: Traugott Koch, Diane Vizine-Goetz: Automatic Classification and Content Navigation Support for Web Services. DESIRE II co-operates with OCLC. In: Annual Review of OCLC Research 1998 http://www.oclc.org/oclc/research/publications/review98/koch_vizine-goetz/automatic.htm  

Koch 1998: Traugott Koch: Nutzung von Klassifikationssystemen zur verbesserten Beschreibung, Organisation und Suche von Internet-Ressourcen. Buch und Bibliothek 50:5, p.326-335. Manuscript with hyperlinks at: http://www.ub2.lu.se/tk/publ/bubmanus.html

Meier 2000: Wolfgang Meier, Matthias N.O. Müller, Stefan Winkler: Virtuelle Bibliothek Sozialwissenschaften. Problembereich und Konzeption. In: Bibliotheksdienst, Vol. 34, No. 7/8, 2000, p. 1236-1244 http://www.dbi-berlin.de/dbi_pub/bd_art/bd_2000/00_07_12.htm

Krause 1999: Jürgen Krause: Sacherschließung in virtuellen Bibliotheken. Standardisierung versus Heterogenität. In: Grenzenlos in die Zukunft. 89. Deutscher Bibliothekarthag in Freiburg im Breisgau 1999. Frankfurt am Main: 2000 (ZfBB-Sonderheft 77)

Krause 1996: Jürgen Krause: Informationserschließung und -bereitstellung zwischen Deregulation, Kommerzialisierung und weltweiter Vernetzung [Schalenmodell]. Bonn: 1996 (= IZ-Arbeitsbericht, Nr. 6); at http://www.bonn.iz-soz.de/publications/series/working-papers/#Informationserschließung pdf file for downlaoding. 

Krause/Marx 2000: Jürgen Krause, Jutta Marx: Vocabulary Switching and Automatic Metadata Extraction or How to Get Useful Information from a Digital Library. In: First DELOS Workshop on Information Seeking Searching and Querying in Digital Libraries, Zürich, Switzerland, 11.-12.12.2000 (forthcoming in the proceedings) 

Krause 2000: Jürgen Krause: Information Systems for Social Science Research. A Perspective from Information Science. In: Symposium Information system for social sciences, 1.-2.10.2000, Mannheim (forthcoming in the proceedings) 

Weibel/Koch 2000: The Dublin Core Metadata Initiative. Mission, Current Activities, and Future Directions. In: D-Lib Magazine 6 (12) 2000 at: http://www.dlib.org/dlib/december00/weibel/12weibel.html