OAI-2003 1 Science & Culture Developing a Knowledge Site in Distributed Information Environments OAI-Forum Bath, September 5th, 2003 Ann Borda Alpay Beler Nick Wyatt Science Museum London UK
Mar 27, 2015
OAI-2003 1
Science & CultureDeveloping a Knowledge Site in Distributed
Information Environments
OAI-ForumBath, September 5th, 2003
Ann BordaAlpay BelerNick Wyatt
Science MuseumLondon UK
OAI-2003 2
Science & Culture 1 Large scale internet project funded by the New Opportunities
Fund (Lottery funding) National Museum of Science & Industry (NMSI)
– the Science Museum (London)– Science and Society Picture Library (London)– National Railway Museum (York)– National Museum of Photography, Film & Television
(Bradford) Audience – life long learners
OAI-2003 3
Science & Culture 2 Aim of the website:
– to make a rich quantity of material & collections accessible, – to contextualise through intelligent display, searching
(resource discovery) and relational linking. – to develop user-focused activities and personalisation tools
that are supported by these resources
OAI-2003 4
Science & Culture 3
Sourced content:– 40,000 digitised images and accompanying text
records– 30,000 library records– 10,000 object records – 50 narrative topics
OAI-2003 5
Issues Different types/functions of legacy systems
Different data standards and platforms in use Diffuse coordination of 'repurposable' content Mix of non-networked and networked systems Varied database connectivity Costs and time in upgrading and configuration
OAI-2003 6
Source Systems OverviewLocation Source
Database Function Type of System
National Railway Museum (York)
iBase Image management Microsoft Data Engine (MSDE) v. 7.
National Museum of Photography, Film & Television (Bradford)
iBase Image management Microsoft Data Engine (MSDE) v. 7.
Science Museum – Science and Society Picture Library (London)
Capture Picture library database File Maker Pro + C++
Science Museum (London)
MultiMIMSY 2000
Collections management Oracle 8.0 running on a Microsoft Windows 2000 server.
Science Museum Library (London)
Unicorn Library management C/ISAM (Informix) with BRS-Search running on SunOS5.7 (Solaris 7).
OAI-2003 7
Primary Records
Library (Unicorn) – AACR2, Marc21Object (MultiMimsy) – SpectrumImage records
– Capture (SSPL) – local guidelines– Ibase (NRM) – local guidelines– Ibase (NMPFT) – local guidelines
Images
OAI-2003 8
Cataloguing
Agreed definitions of DC elements Agreed list of qualifiers Core fields for export NMSI-wide image cataloguing
guidelines Procedures for cataloguing objects Cataloguing standards document
OAI-2003 9
Authority Control
Importance Authority fields People – authorised name, normalised name, dates,
biography etc. Authority lists for
– people – organisations – places – events/periods
On-going process
OAI-2003 10
Data Mapping
Agreed mappings to DC elements Need for concatenation, e.g. names Matching with authority files Linking to other record types Mapping table Need for vigilance & checking
OAI-2003 11
‘Interim’ Database The logical design would focus on:
Simplicity and efficiency of retrieval of informationOptimisation and consistency of information
NOT a cataloguing database a collection point for data DC fields as primary data structure enable data ‘normalisation’ export as DC fields in XML wrapper end-purpose to populate the web CMS
OAI-2003 12
Export Formats
Location Source Database Export
National Railway Museum(York)
iBase XML
National Museum ofPhotography, Film &Television (Bradford)
iBase XML
Science Museum – Scienceand Society Picture Library(London)
Capture CSV
Science Museum (London) MultiMIMSY 2000 XMLScience Museum Library(London)
Unicorn Tab delimited
OAI-2003 13
XML RECORD- <dcschema xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="D:\NOF\dcschema.xsd">- <dc_record> <dc_identifier>1980-108</dc_identifier>- <dc_creator1> <name>Coster, Salomon, d. 1659</name> <role>maker</role> </dc_creator1> <dc_relation relationType="Object">T/1980-108</dc_relation> <dc_relation relationType="Image">TIM100282</dc_relation> <dc_relation relationType="Image">10326522</dc_relation> <dc_subject subjectType="Controlled">pendulum clock</dc_subject> <dc_type>physical object | text</dc_type> <dc_format>text | xml</dc_format> <dc_title>’Haagsche Klokje’, pendulum clock, c. 1657.</dc_title> <dc_description>This clock by Salomon Coster (d. 1659) of the Netherlands is one of theearliest pendulum clocks ever made. The Dutch scientist Christiaan Huygens (1629-1693)designed the first successful pendulum clock in 1656. He worked with Coster, anexperienced clockmaker, to apply his new invention to commercial use. In 1657 Huygenshad the patent protecting his invention assigned to Coster, but sadly Coster died suddenlyin 1659 after producing only a few pendulum clocks. This is one of only seven pendulumclocks made by Coster known to have survived. The application of the pendulum clock totimekeeping during the scientific revolution of the 17
th century was arguably the most
fundamental advance in the history of time measurement.</dc_description> <dc_date_created>c. 1657</dc_date_created> <dc_subject_broad2>Time Measurement</dc_subject_broad2> <dc_cover_spatial>World > Europe > Netherlands > The Hague</dc_cover_spatial> <dc_language>en</dc_language> <audit_ok_for_nof>true</audit_ok_for_nof> </dc_record>
OAI-2003 14
Image management 3 image management systems need to reference image files with the text records text records to retain ‘mapped’ reference to image
files upon export ability to process TIFF images to JPEGs for web
Solution: Interim database back-end is an iBase system to
handle image relationships & processes
OAI-2003 158
Metadata & resource discovery
Common metadata set
Common syntax
Consistency of content
Dublin Core
XML/DOM/Common validation
Rules for contentCommon approachCataloguing rules
OAI-2003 16
Digitisation Infrastructure
OAI-2003 17
Web Content Management System (CMS)
COM database MS based architecture & server platform specified to handle XML and DC elements ability to handle metadata from the interim database. ability to manage relationships
ability to handle image references and image files
OAI-2003 18
Ingenious Site
Subject relatedResources
Subject relatedDebate
T OPIC 1See separate
structure
T OPIC 2
SUBJECT S(Editorial)
READ
UserRegistration
DebateDebate List
DEBAT E
Im agesLibrary records
ObjectsResources
SEARCH(Subject
Categories)
SEE
UserRegistration
M y Lightbox M y Gallery M y Links
CREATE
HOM E
OAI-2003 19
Topic Structure
Core Topic Text
Activity
“Relevance Toolbar”
Biography
Voices
Unusual Takes
Glossary
Debate
Related Images
Related Stories
Related Library
Records
OAI-2003 20
Collection Level Description
DC & RSLP CLD elements used Allows consistency for searching Overall CLD for whole site Subject Level CLD Topic Level & Topic Part CLD
OAI-2003 21
Site CLD<META NAME="DC.Title" CONTENT="Ingenious">
<META NAME="DC.Description" CONTENT="This site makes connections between people, innovations and ideas. It contains images and other resources illustrating human endeavour and development from the Science Museum, National Railway Museum and National Museum of Photography, Film & Television. Subjects and topics put these images in context, giving historical and cultural insights on current issues in science, technology and medicine.">
<META NAME="DC.Rights" CONTENT="http://www.sciencemuseum.org.uk/copyright/copyright.asp">
<META NAME="DC.Creator" CONTENT="Science Museum | National Railway Museum | National Museum of Photography, Film & Television">
<META NAME="DC.Publisher" CONTENT="National Museum of Science & Industry">
<META NAME="DC.Language" CONTENT="en-uk">
<META NAME="DC.Type.category" CONTENT="collection">
<META NAME="DC.Format" CONTENT="text/html">
<META NAME="DC.Date.created" CONTENT="2003”>
<META NAME="DC.Identifier" CONTENT="http://193.71.79.113/">
<META NAME="DC.Subject.LCSH" CONTENT="Culture | Science | Technology | Medicine | Photography | Transport | Railroads | Industries | History">
<META NAME="DC.Relation.HasPart" CONTENT="sacsub01 | sacsub02 | sacsub03 | sacsub04 | sacsub05 | sacsub06 | sacsub07 | sacsub08 | sacsub09 | sacsub10 | sacsub11 | sacsub12">
OAI-2003 22
SEARCH - Subject Metadata
OAI-2003 23
Topic metadata<!-- DC begin --><META NAME="DC.Title" CONTENT="Home and away; ">
<META NAME="DC.Identifier" CONTENT="sacsub04; ">
<META NAME="DC.Description" CONTENT=" ; ">
<META NAME="DC.Format" CONTENT="text/html">
<META NAME="Robots" CONTENT="all">
<META NAME="DC.Language" CONTENT="en-uk">
<META NAME="DC.Type.category" CONTENT="collection">
<META NAME="DC.Creator" CONTENT="; ">
<META NAME="DC.Date.Created" CONTENT="2003">
<META NAME="CLD:hasLocation" CONTENT="; ">
<META NAME="DC:Subject.SAC" CONTENT="Home and away; ">
<META NAME="DC:Subject.Broad2" CONTENT="Cinematography & Film | Entertainment | Photography | Photography: Equipment | Radio | Sound Reproduction & Acoustics | Television | Computing & Data Processing | Clothing | Domestic Life & Household Management | Firemaking | Lighting | Social & Economic Life | Sports & Pastimes | Building Construction & Architecture | Civil Engineering | Firefighting | Plastics ; ">
<META NAME="DC:Subject.Keywords" CONTENT="; ">
<META NAME="DC.Rights" CONTENT="http://www.sciencemuseum.org.uk/copyright/copyright.asp">
<META NAME="CLD:Owner" CONTENT="National Museum of Science & Industry">
<META NAME="DCQ:hasPublication" CONTENT="sactopic19 | sactopic20 | sactopic21 | sactopic22; ">
<META NAME="DCQ:isPartOf" CONTENT="Ingenious; ">
<META NAME="CLD:isDescriptionOf" CONTENT="NMSI Digitised resources; "><!-- DC end -->
OAI-2003 24
Sustainability - Present Conform to future standards (e.g. Dublin Core) Repurposable data in an XML wrapper: ‘Create
once and use many’ Customisable according to needs Ability to share data across organisations Multiple platform delivery: different channels Community building
OAI-2003 25
Science & Culture - Repurposing
OAI-2003 26
Future - Semantic Web
Search Engines
Knowledge ManagementSystems & Applications
HTML WebPages
Agents
MetadataDublin CoreRDF SchemaXML
Resource DiscoveryServices
Information Retrieval
Taxonomies
OAI-2003 27
Future – Creating Communities Interest groups linked to subject hierarchies Groups generate knowledge Knowledge added to existing “formal data” Taps into “informal” knowledge
– unwritten– oral– practical
The nature of knowledge on the site changes
OAI-2003 28
Presented by
Ann Borda, NOF Project [email protected]
Alpay Beler, NOF IS [email protected]
Nick Wyatt, NOF Metadata [email protected]
Science Museum, London