Top Banner
Metadata Metadata Mark-up and Management olf Knoll, National Library of the Czech Republic
19

Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Jan 20, 2016

Download

Documents

Claude Campbell
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Metadata

Metadata Mark-up and Management

© Adolf Knoll, National Library of the Czech Republic

Page 2: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Metadata

It is added value to digital files for which it forms a container to identify them to enable easier access and navigation to control the entire compound document to enable archival storage to enable research work and publication of even

critical editions, etc.

Page 3: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Compound Document

The document consisting of interconnected metadata and data files the metadata are added descriptions (mostly

pieces of text) the data are any external files produced by

digitizing pieces of original documents (images, texts, sound files, even video files)

Page 4: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

What is described?

OBJECTS- of which the document consists and which

build the document

- which have their unchanging substance

- whose representations can vary in their different occurrences

- which can have some important additional characteristics

Page 5: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Object OISEAU

BIRDPTÁK

VOGEL

CockKohoutHahn

EagleOrelAdler

PenguinTučňákPinguin

FalconSokolFalke

DuckKachnaEnte

Page 6: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Objects

They are defined by the creator or interpreter of the document

They can be built from any sequence or amount of bits in metadata or data areas

It should be established: which types of objects must be distinguished how they should be marked

Page 7: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Object OISEAU

We have decided to have such an object (animal with wings, feathers, laying eggs)

We have decided to mark anything having these characteristics as OISEAU

We know that this object has different names in different languages (bird, pták, Vogel, птица, pasăre, …)

We know that in reality only concrete birds appear (duck, cock, falcon, penguin, eagle, …)

Page 8: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Objects and contents

Semantically poor content

• formal object (paragraph, heading, note, …)

• used for formatting• languages built on

these objects are used for output (HTML, MS WORD, …)

• PRESCRIPTIVE MARK-UP

Semantically rich content

• content oriented object (author, flower, house, …)

• used for understanding• languages built on

these objects are used for description (MARC, TEI, EAD, DOBM, …)

• DESCRIPTIVE MARK-UP

Page 9: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

SGMLStandard Generalized Markup Language

• a general language to mark objects• to be applied, it needs to become more concrete

(this is made via DTD)• thus, second level applications can be written• these applications are used directly or they require

additional definitions (DTDs)• SGML applications: HTML, XML, TEI, …

Page 10: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

DESIGNING OUR PROJECT

Page 11: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

What do we need?

Open communication Internal precision and cohesion of markup Multiple output, reuse of marked data, liberty

to add new marked data Complex document control and

management Open and flexible content-oriented

description principle

Page 12: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

What do we work with?

For a manuscript having 300 pages, we work with: more than 1500 digital data files produced through

digitization (Gallery, Preview, Internet, User, Excellent quality levels: 300x5 + images for covers, end-sheets, ...)

more than 300 description metadata files (each digitized piece of the original + files for bibliographic and technical descriptions + technological files)

This means that the above mentioned requirements must be applied to a complex document consisting of hundreds of computer files, which play various roles.

Page 13: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

Independency

Metadata should be independent of display – pure values

We must know: which features of objects to describe – we need

DESCRIPTION RULES how to mark up these objects – we need RULES for

MARK-UP how to formalize which objects and how will be described

– DTD how to display the compound document – we need rules

for display (transformation rules) If the platform is SGML or XML, we write DTD and

XSL tools.

type of document; place

Page 14: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

place of publishing; publisher; date; addressee

Page 15: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

description elements

author type of document: postcard place: Hronov place of publishing: Hronov publisher: Karel Šefelín date: 1914 addressee: František Bittnar annotation: Streets of Hronov in 1914; postcard written by my

great-grandmother to her husband making military service

However, maybe there are better rules, e.g. AACR2 defining how to describe a postcard – we should take them or some approach largely applied than this proposal of ours.

Page 16: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

how to mark the elements?

In DTD: <!ELEMENT PlaceOfPublication

(#PCDATA)>

In Metadata File: <PlaceOfPublication>Hronov</

PlaceOfPublication>

Page 17: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

write

<Author></Author> <TypeOfDocument>postcard</TypeOfDocument> <Place>Hronov</Place> <PlaceOfPublication>Hronov</PlaceOfPublication> <Publisher>Karel Šefelín</Publisher> <Date>1914<Date> <Addressee>František Bittnar<Addressee> <Annotation>Streets of Hronov in 1914; postcard

written by my great-grandmother to her husband making military service</Annotation>

Page 18: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

publish

XSL transformation of the XML files … in order to display them

Index by a database tool and provide even a better access

Link metadata with image data

This is work for professionals

Page 19: Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.

tools

Simple browsing Internet access tools