The TEI Header Manuscript Description Metadata: The TEI Header and Manuscript Description TEI@Oxford 2010-07 Oxford TEI Summer School 2010 1/51
The TEI Header Manuscript Description
Metadata: The TEI Header and ManuscriptDescription
TEI@Oxford
2010-07
Oxford TEI Summer School 2010 1/51
The TEI Header Manuscript Description
The TEI Header
The TEI header was designed with two goals in mind
needs of bibliographers and librarians trying to document‘electronic books’
needs of text analysts trying to document ‘coding practices’within digital resources
The result is that discussion of the header tends to be pulled in twodirections...
Oxford TEI Summer School 2010 2/51
The TEI Header Manuscript Description
The Librarian’s Header
Conforms to standard bibliographic model, using similarterminology
Organized as a single source of information for bibliographicdescription of a digital resource, with established mappings toother such records (e.g. MARC)
Emerging code of best practice in its use, endorsed by majordigital collections
Pressure for greater and more exact constraints to improveprecision of description: preference for structured data overloose prose
Oxford TEI Summer School 2010 3/51
The TEI Header Manuscript Description
Everyman’s Header
Gives a polite nod to common bibliographic practice, but has afar wider scope
Supports a (potentially) huge range of very miscellaneousinformation, organized in fairly ad hoc ways
Many different codes of practice in different user communities
Unpredictable combinations of narrowly encodeddocumentation systems and loose prose descriptions
Oxford TEI Summer School 2010 4/51
The TEI Header Manuscript Description
TEI Header Structure
The TEI header has four main components:
< leDesc> ( le description) contains a full bibliographicdescription of an electronic le.
(encoding description) documents therelationship between an electronic text and the source orsources from which it was derived.
(text-pro le description) provides a detaileddescription of non-bibliographic aspects of a text, speci callythe languages and sublanguages used, the situation in whichit was produced, the participants and their setting. (just abouteverything not covered in the other header elements
(revision description) summarizes the revisionhistory for a le.
Only < leDesc> is required; the others are optional.
Oxford TEI Summer School 2010 5/51
The TEI Header Manuscript Description
Example Header: Minimal required header
.
.
. ..
.
.
A title?
Who published?
Where from?
Oxford TEI Summer School 2010 6/51
The TEI Header Manuscript Description
The TEI supports two ‘levels’ or types of header
corpus levelmetadata sets default properties for everything ina corpus
text levelmetadata sets speci c properties for one componenttext of a corpus
Oxford TEI Summer School 2010 7/51
The TEI Header Manuscript Description
Corpus Header Example
.
.
. ..
.
.
Oxford TEI Summer School 2010 8/51
The TEI Header Manuscript Description
Types of content in the TEI header
free proseprose description: series of paragraphsphrase: character data, interspersed with phrase-levelelements, but not paragraphs
grouping elements: specialised elements recording somestructured information
declarations: Elements whose names end with the suffix Decl(e.g. subjectDecl, refsDecl) enclose information about speci cencoding practices applied in the electronic text.
descriptions: Elements whose names end with the suffix Desc(e.g. , ) contain a prosedescription, possibly, but not necessarily, organised undersome speci c headings by suggested sub-elements.
Oxford TEI Summer School 2010 9/51
The TEI Header Manuscript Description
File Description
has some mandatory parts:: provides a title for the resource and any associatedstatements of responsibility: documents the sources from which theencoded text derives (if any): documents how the encoded text ispublished or distributed
and some optional ones:: yes, electronic texts have editions too: and they also t into "series".: how many oppy disks, gigabits, les?: notes of various types
Oxford TEI Summer School 2010 10/51
The TEI Header Manuscript Description
The File Description
: contains a mandatory which identi es theelectronic le (not its source!)
optionally followed by additional titles, and by ‘statements ofresponsibility’, as appropriate, using , ,, , or the generic : may contain
plain text (e.g. to say the text is unpublished)one or more , , , eachfollowed by , , ,
Oxford TEI Summer School 2010 11/51
The TEI Header Manuscript Description
A minimal header for Punch
.
.
. ..
.
.
Punch, or the London Charivari: an electronic
editionOwen Seaman (1861-1936)
TEI versionTEI@Oxford team
Unpublished
Recoded from the Project Gutenberg versions
Oxford TEI Summer School 2010 12/51
The TEI Header Manuscript Description
Title- and Responsibility- statements...
There may be many of them:.
.
. ..
.
.
ArtameneLe Grand CyrusDigital Edition
Amongst the guilty parties:.
.
. ..
.
.
Scudery, Madeleine deGeffin, AlexandreFonds Nationale Suisse de la Recherche Scientifique
Encoding checkJean Untel
Oxford TEI Summer School 2010 13/51
The TEI Header Manuscript Description
example
.
.
. ..
.
.
TEI ConsortiumOxford Text Archive1256
Available under the terms of a Creative Commons Attribution and
Share Alike licence.
Oxford TEI Summer School 2010 14/51
The TEI Header Manuscript Description
example
can contain notes on almost any aspect:.
.
. ..
.
.
Material prepared for the TEI@Oxford Summer School.
Oxford TEI Summer School 2010 15/51
The TEI Header Manuscript Description
The Source DescriptionAll electronic works need to indicate their source, even if it is just tosay that it is 'born digital'. There are variety of ways to do this:
prose description
: contains free text or any mixture of bibliographicelements such as , etc.
contains effectively the same elements butconstrained in various ways according to bibliographicstandards
special-cases texts which were born TEI byreplicating an embedded < leDesc>
A may be used for lists of such descriptions
Specialised elements for spoken texts ( etc.)and for manuscripts () Discussed later!
Authority lists for e.g people () or places() can be included.
Oxford TEI Summer School 2010 16/51
The TEI Header Manuscript Description
examples
.
.
. ..
.
.
Born digital.
.
.
. ..
.
.
Enigma, Punch: or the
London Charivari, July 1,1914, 147, p. 6
Oxford TEI Summer School 2010 17/51
The TEI Header Manuscript Description
vs. Example
.
.
. ..
.
.
Enigma, in Punch: or the
London Charivari (July 1, 1914), vol 147, pp. 1-20
.
.
. ..
.
.
Enigma
Punch: or the London Charivari
LondonJuly 1, 19141471-20
Oxford TEI Summer School 2010 18/51
The TEI Header Manuscript Description
Encoding Description
groups notes about the procedures used whenthe text was encoded, either summarised in prose or within speci celements such as
: goals of the project
: sampling principles
: editorial principals, e.g. ,, , ,,
: classi cation system/s used
: speci cs about usage of particular elements
The can replace the user manual, or facilitatesemi-automatic document management, given agreed codes ofpractice.
Oxford TEI Summer School 2010 19/51
The TEI Header Manuscript Description
Example (1).
.
. ..
.
.
The Imaginary Punch Project aims to ....
All pages containing editorial text have been
transcribed in full. Pages containing only advertisements orillustrations have been omitted.
Original spelling has been retained, except that
words hyphenated across line breaks have been silentlyre-assembled. The hyphen has been retained only where thereexist cases of the same word being hyphenated in mid-lineposition.
Oxford TEI Summer School 2010 20/51
The TEI Header Manuscript Description
Example (2).
.
. ..
.
.
story occupies more than half a page
story occupies between quarter and a half page
story occupies less than a quarter page
Refers to domestic political events
Refers to foreign political events
refers to role of women in society
refers to role of servants in society
Oxford TEI Summer School 2010 21/51
The TEI Header Manuscript Description
Pro le Description
A collection of descriptions, categorised only as ‘non-bibliographic’.Default members of the model.pro leDescPart class include:
: information about the origination of theintellectual content of the text, e.g. time and place
: information about languages, registers, writingsystems etc used in the text
and : classi cations applied to the textby means of a list of speci ed criteria or by means of acollection of pointers, respectively
and : information about the‘participants’, either real or depicted, in the text
: information about the hands identi ed in amanuscript
Oxford TEI Summer School 2010 22/51
The TEI Header Manuscript Description
Language and character set usage
The element is provided to document usage oflanguages in the text. Languages are identi ed by their ISO codes:.
.
. ..
.
.
EnglishFrenchBulgarian in Cyrillic characters Romanized Bulgarian
Oxford TEI Summer School 2010 23/51
The TEI Header Manuscript Description
Classi cation Methods
provides a classi cation (by domain, medium, topic...)for the whole of a text expressed in one or more of the followingways:
using direct reference to a locally de ned (e.g. in thecorpus header) category
using reference to some commonly agreed andexternally de ned category (e.g. UDC)
using assign arbitrary descriptive terms taken from abibliographic controlled vocabulary or a tag cloud
Oxford TEI Summer School 2010 24/51
The TEI Header Manuscript Description
BNC Example.
.
. ..
.
.
W nonAc: humanities arts
History, Modern - 19th centuryCapitalism - History - 19th centuryWorld, 1848-1875
.
.
. ..
.
.
This categorization applies to the whole text. For more ne grainedclassi cation, use@decls on e.g. a element.
Oxford TEI Summer School 2010 25/51
The TEI Header Manuscript Description
Revision Description
A list of elements, each with a@date and@whoattributes, indicating signi cant stages in the evolution of adocument.
Most recent rst.
Can be maintained manually, but better done by means of aCMS (change management system)
.
.
. ..
.
.
$LastChangedDate: 2010-06-28 09:14:36 +0100 (Mon, 28 Jun
2010) $.$LastChangedBy: lou $$LastChangedRevision: 10346 $
Oxford TEI Summer School 2010 26/51
The TEI Header Manuscript Description
Manuscript Description
Why are manuscripts special?
Manuscripts are unique objects, often of great cultural orpolitical value.
Books, by contrast, exist in multiple copies, and can bedescribed adequately by well-established and formalisedbibliographic conventions.
For manuscripts, there are several traditions, often descriptiveor belle lettriste, and little consensus.
Similar concerns apply to other text-bearing objects.
Oxford TEI Summer School 2010 27/51
The TEI Header Manuscript Description
Objectives of
The TEI element is intended for several different kinds ofapplications:
standalone database of library records ( nding aid)
discursive text collecting many records (catalogue raisonné)
metadata component within a digital surrogate (electronicedition)
tool for ‘quantitative codicology’
Oxford TEI Summer School 2010 28/51
The TEI Header Manuscript Description
Catalogue Raisonné
An can appear anywhere a
paragraph can.
.
. ..
.
.
The Arnamagnæan Manuscript Collection
The Arnamagnæan Collection is widely recognised as one of the
most significant collections of early Scandinavian manuscripts inthe world…
Among its more important holdings are:
In the following manuscript….
Oxford TEI Summer School 2010 29/51
The TEI Header Manuscript Description
Having one's cake and eating it
Two con icting desires:
preserve (or perpetuate) existing descriptive prose
reliable search, retrieval, and analysis of data
The tries, wherever possible, to do both of these things.
Oxford TEI Summer School 2010 30/51
The TEI Header Manuscript Description
Components of a manuscript descriptionWithin the element come a required element, which groups information identifying the manuscript,followed by an optional , which can be used to provide in abrief, unstructured way information on the manuscript's contentsetc. These are then followed either by one or more paragraphs(
), or one or more of the following specialised elements:
: an itemised list of the intellectual content ofthe manuscript, with transcriptions of rubrics, incipits, explicitsetc, as well as primary bibliographic references
: groups information concerning all physicalaspects of the manuscript, its material, size, format, script,decoration, binding, marginalia etc.
: provides information on the history of themanuscript, its origin, provenance and acquisition by itsholding institution
Oxford TEI Summer School 2010 31/51
The TEI Header Manuscript Description
Components of a manuscript description (cont.)
: groups other information about the manuscript,in particular, administrative information relating to itsavailability, custodial history, surrogates etc.
: contains in essence a nested , in cases ofcomposite manuscripts now regarded as constituting a singleunit but made up of two or more parts which were originallyphysically distinct.
Within each of these elements a number of sub-elements isavailable; , for example, will normally consist of oneor more elements, each in turn containing speci celements for , , and , aswell as the standard TEI elements , and forbibliographic references. As with itself, however,the contents of these rst-level and second-level elements need notbe this structured, since there is also the option of using paragraphs.
Oxford TEI Summer School 2010 32/51
The TEI Header Manuscript Description
Identi cation (1)
The
Traditional three part speci cation:
place (, , )
repository (, )
identi er (, ).
.
. ..
.
.
CanadaOttawaLibrary and Archives CanadaE.W.B. MorrisonMG 30 E 81 v. 16
Oxford TEI Summer School 2010 33/51
The TEI Header Manuscript Description
Identi cation (2)
Alternative or additional names can also be included:.
.
. ..
.
.
DanmarkKøbenhavn Det ArnamagnæanskeInstitut AM 45 fol.Codex FrisianusFríssbók
Oxford TEI Summer School 2010 34/51
The TEI Header Manuscript Description
Intellectual ContentMay simply use paragraphs of text…
… or a tree of elements
… optionally preceded by a prose summary
We can describe the content in general terms:.
.
. ..
.
.
An extraordinary charivari of heroic deeds and improving tales,
including an early version of Guy of Warwick andseveral hymns.
or we can provide detail about each distinct item:.
.
. ..
.
.
An extraordinary charivari of heroic deeds, improving
tales, and hymns.
Oxford TEI Summer School 2010 35/51
The TEI Header Manuscript Description
The element
Manuscripts contain identi able items, usually physically tied to alocus.
, if present, must be given rstthen any of the following, in a speci ed order:
, , , , , ,< nalRubric>, , , , , …… or nested s
Oxford TEI Summer School 2010 36/51
The TEI Header Manuscript Description
with multiple s
.
.
. ..
.
.
fols. 5r-7vAn ABC
fols. 7v-8vLenvoy de Chaucer a
Scogan
fols. 14r-126vTroilus and CriseydeBk. 1:71-Bk. 5:1701, with additional losses due to
mutilation throughout
Oxford TEI Summer School 2010 37/51
The TEI Header Manuscript Description
Physical Description
An arti cial (but helpful) grouping of many distinct items.
You can simply supply paragraphs of prose, covering such topics as
: the physical carrier
: what is carried on it
, ,
and
: accompanying material
Or, group your discussion within the speci c elements mentionedabove.
Similarly, within the speci c elements, you can supply paragraphsof prose, or further speci c elements.
Oxford TEI Summer School 2010 38/51
The TEI Header Manuscript Description
The carrier 1
The can contain just paragraphs, or and .
.
. ..
.
.
Early modern parchment andpaper.
Oxford TEI Summer School 2010 39/51
The TEI Header Manuscript Description
The carrier 2
A more complex substructure with speci c elements for ,, , , .Multiple layouts may also be speci ed:.
.
. ..
.
.
Between 25 and 32 ruled
lines.
Between 34 and 50 ruled
lines.
Oxford TEI Summer School 2010 40/51
The TEI Header Manuscript Description
and
(note on hand) describes a particular style orhand distinguished within a manuscript.
contains a note describing either a decorativecomponent of a manuscript or a fairly homogenous class ofsuch components.
Oxford TEI Summer School 2010 41/51
The TEI Header Manuscript Description
example (1)
.
.
. ..
.
.
The manuscript is written in two contemporary hands, otherwise
unknown, but clearly those of practised scribes. Hand I writesff.1r-22v and hand II ff. 23 and 24. Some scholars, notablyVerner Dahlerup and Hreinn Benediktsson, have argued for a thirdhand on f. 24, but the evidence for this is insubstantial.
Oxford TEI Summer School 2010 42/51
The TEI Header Manuscript Description
example (2)
.
.
. ..
.
.
The first part of the manuscript, fols
1v-72v:4, is written in a practised IcelandicGothic bookhand. This hand is not found elsewhere.
The second part of the manuscript,
fols 72v:4-194, is written in a handcontemporary with the first; it can also be found in afragment of Knýtlinga saga, AM 20b II
fol..
Oxford TEI Summer School 2010 43/51
The TEI Header Manuscript Description
The element can be used to list or describe anyadditions to the manuscript, such as marginalia, scribblings,doodles, etc., which are considered to be of interest or importance..
.
. ..
.
.
The text of this manuscript is not interpolated with sentences
from Royal decrees promulgated in 1294, 1305 and 1314. In themargins, however, another somewhat later scribe has added therelevant paragraphs of these decrees, see pp. 8, 24, 44, 47etc.
As a humorous gesture the scribe in one opening of themanuscript, pp. 36 and 37, has prolonged the lower stems of oneletter f and five letters þ and has them drizzle down themargin.
Oxford TEI Summer School 2010 44/51
The TEI Header Manuscript Description
(accompanying material) contains details of anysigni cant additional material which may be closely associated withthe manuscript being described, such as non-contemporaneousdocuments or fragments bound in with the manuscript at someearlier historical period..
.
. ..
.
.
A copy of a tax form from 1947 is included in the envelopewith the letter. It is not catalogued separately.
Oxford TEI Summer School 2010 45/51
The TEI Header Manuscript Description
: where it all began
: everything in between
: how you acquired it
is datable element and thus has attributes@notBefore and@notAfter,@when etc.
Oxford TEI Summer School 2010 46/51
The TEI Header Manuscript Description
Example
.
.
. ..
.
.
Written in England in the
13th cent.
On fol. 54v very faint is Iste liber est fratris guillelmi
de buria de Roberti ordinisfratrum Predicatorum
, 14th cent. (?):hanauilla is written at the foot of the page (15th
cent.).
Bought from the Rev. W. D. Macray
on March 17, 1863, for 1 pound10s.
Oxford TEI Summer School 2010 47/51
The TEI Header Manuscript Description
information
: administrative information
: information about other surrogates, i.e.photographs, digital images etc.
: accompanying material
: bibliography
Oxford TEI Summer School 2010 48/51
The TEI Header Manuscript Description
Administrative information
record history
availability
custodial history
miscellaneous remarks
.
.
. ..
.
.
Conserved between March 1961 and February 1963 at Birgitte
Dalls Konserveringsværksted.
Photographed in May 1988 by AMI/FA.
Oxford TEI Summer School 2010 49/51
The TEI Header Manuscript Description
And nally
A can contain , essentially a nested ,where originally distinct manuscripts or parts of a manuscripts havebeen brought together to form a composite manuscript..
.
. ..
.
.
AmiensBibliothèque MunicipaleMS 3Maurdramnus Bible
MS 6
Oxford TEI Summer School 2010 50/51
The TEI Header Manuscript Description
ConclusionsThe TEI header was originally conceived as something fornon-specialist usage but has everything needed for rigorousbibliographic descriptionIt provides detailed methods for encoding specialist itemssuch as manuscript descriptions or details concerning spokentexts or linguist corporaStandard codes of practice or ways of using have beendeveloped by particular user communities (e.g. digitallibrarians, corpus linguists)As a ‘primary source of information’ it remains an essentialframework for documenting:
what your text iswhere it came fromhow you encoded ithow it may be used (technically)how it may be used (legally)
Oxford TEI Summer School 2010 51/51
The TEI HeaderManuscript Description