Metadata for Digital Objects October 4, 2004 Creation Metadata
Jan 12, 2016
Review class metadata sets: What categories do they fill? Dublin Core EAD FRBR INDECS MARC/MODS METS MIX/NISO MPEG21
NLM Journal OCLC/RLG ONIX PRISM RKMS TEI Lite VRA Core
What happens at file creation?
Consciously– You place into the file everything you think it
needs in order to be useful– You save it with a file name that will help you find
it again (you hope) Unconsciously
– Metadata is added using environmental information
– Metadata is added using information elicited from you
Viewing Word Metadata in XML
Add relevant metadata to your Word document as outlined above
If you have Word 2003, save the document as XML
Open the document in an XML editor If not, save the document as HTML View the document in Notepad or another
ASCII editor or view source from the HTML document displayed in a browser
The future of XML in Word?
Word already provided XML markup of its Document Properties and Custom Document Properties metadata; in Word 2003 a native (and patented) XML schema is used.
Several vendors made plugins for making older Word documents into XML documents:– eXportXML from Schultz: a template installed into Word
using a macro– Xfinity Author Wx from B-Bop: makes Word into an XML
editor for ordinary documents– WorX SE from Xyenterprise: Word as XML editor or
creator of XML objects
Future of XML at Microsoft
XML and the whole model of the way the web works are becoming part of the emerging Microsoft operating environment from servers to desktop
Note already a move to a standard IE interface for system functions
Presently provides tools to programmers under “Information Bridge” framework to allow connecting XML documents created by Microsoft programs via metadata elements to web services
Creation metadata thus vital to this whole scheme
[non-Microsoft] Uses of creation metadata
Establishing prior art for an invention Identifying who knew what and when Showing how an object fits into the larger
scheme of things (preserving the “archival bond”)
Keeping track of versions of an object Providing assurance of reliability: that the
object is what it purports to be Anchoring the object in the place and time of its
origin
Placement of creation metadata
Same options as for all metadata Embedded within the object (Word metadata) Wrapped around the object (object is
embedded in metadata document: Word document containing metadata embedded in XML document extracting reliability metadata)
Captured, communicated, or kept separately from the object (non-text objects but not only them)
UBC Creation Metadata I
A word on diplomatics The notion of a complete record
– Medium– Content– Form– Persons (author, writer, addressee, creator)– Acts– Archival bond– Transmission (intent, capability, success)
UBC Creation Metadata II
“Elements of intellectual form” inside the record:– Date (time of transmission and receipt; place of
transmission)– Superscription or attestation (author/originator)– Inscription (all addressees and receivers)– Title and/or subject– Disposition/purpose (the intention of the record)
UBC Creation Metadata III
The notion of reliable record; must in addition to completeness have:
“Document profile” as container for the object– Date available (created or received)– Time available (created or received)– [Date and time of further transmission]– Author– Addressee– Subject [classification code, registry number]
DoD 5015.2 Recordkeeping Standard: Assumptions Note that 5015.2 assumes an entire
detailed recordkeeping system that fully accounts for all records at the series, folder, and individual level
The “file plan” defines the recordkeeping system; the “schedule” is applied to entities defined in the file plan
DoD 5015.2 Recordkeeping Standard: record metadata (*=mandatory) Unique identifier* Supplemental marking
list Subject/Title* Media type* Format* Date filed* Publication date* Date received
Author/Originator* Addressee* Other addressees* Originating
organization* Location Vital record indicator Vital record
review/update cycle* User-defined fields
DoD 5015.2 email metadata mapping Sender (Author/Originator) Primary addressees (Addressee) Other addressees (Other addressee) Date/time sent (Publication date) Date/time received (Date received) Subject (Subject/Title)
Date/time and persons: vital to records’ reliability Without date/time, not possible to manage
records by date: cutoffs, retention, destruction Without persons (author, recipient, creator),
nobody would care Without hierarchical set of data categories, no
structure Note dependence on systems in which
records creation are embedded
What about non-text objects?
Creation metadata for non-text objects covers much the same ground:– Information about occasion of creation– Information about creator, intention, receiver– Information about the object itself
Many kinds of non-text objects:– Images, still and moving– Sound– Multimedia
Connecting metadata to a non-text object Object is kept in specifically-defined file structure File name/ID is crucial to the connection XLink is used to connect the two using a series
of XML attributes:<graphicxmlns:xlink=“http://www.w3.org/1999/xlink”xlink:type=“simple”xlink:href=“usr/local/coll1/object9.tif”xlink:show=“new”xlink:actuate=“onRequest”/>