Top Banner
10 november 2004 Metadata and XML 1 InfoDesign: Understanding by Design Peter J. Bogaards www.bogieland.com Metadata and XML Improving the Findability of Information Peter J. Bogaards (BogieLand.com) Information Designer & Information Architect “Sharing knowledge is better than having it.” EIDC 2004 - Wiesbaden 10 november 2004
34

Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

Aug 27, 2018

Download

Documents

buihanh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 1

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Metadata and XMLImproving the Findability of Information

Peter J. Bogaards (BogieLand.com)Information Designer & Information Architect“Sharing knowledge is better than having it.”

EIDC 2004 - Wiesbaden10 november 2004

Page 2: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 2

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Introduction

• Background in instructional design. (1987)

• Design of (tech) facilities to enhance human learningprocesses.

• Interface, document and information designer.(@Informaat ‘90-‘97).

• W3: Electronic documentation and user interfacedesign merger.

• Information designer and information architect.(@Razorfish EU 2000-2003).

• BogieLand (2003): Information design & informationarchitecture consultancy.

• InfoDesign: Understanding by Design (>1997). informationdesign.org

Page 3: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 3

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Agenda

• Purpose: To paint the landscape

• Findability of information

• XML and metadata

• Subject-based classification:Controlled vocabularies, Thesaurus, Taxonomy, and Ontology

• Faceted classification (XFML)

• Technologies: Topic maps and RDF

• A vision for the future

• ?&!

Page 4: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 4

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Findability of InformationFinding anyone or anything fromanywhere at anytime

Page 5: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 5

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Findability of information

• A wealth of information = a poverty of attention

• Structure versus chaos

• Information architecture: How to organize information in order to letpeople find things?

• Applying concepts, methods and techniques from Library andInformation Science

• How to improve information retrieval?

• Documents are for humans, data is for machines

Page 6: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 6

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

XMLeXtensible Markup Language

Page 7: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 7

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

XML: eXtensible Markup Language

• SGML -> HTML/XHTML -> XML

• A language for making <tag> sets.

• Meaningful tags for search and information retrieval.

• Machine understandable information.

• Document structure: XML schema

• Document content: XML name spaces

• Document presentation: XSL(T), SVG et al.

Page 8: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 8

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

MetadataData about data

Page 9: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 9

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Metadata: Data about Data, not Code

• Information about objects on subjects - metadata describes objects.

• Purposes: Information management and discovery.

• Metadata enables content to be retreived, tracked, and assembledautomatically.

• Metadata is machine understandable information about (web)resources and is the foundation of all information retrieval.

• Metadata is any statement about an information resource.

• Metadata is a writing skill.

Page 10: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 10

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Email document: Attribute value pairs

From: Peter J. Bogaards ([email protected])

To: Michael Fritz ([email protected])

Date: Nov. 10, 2004

Hi Michael,

How are you?

Best,

Peter

Page 11: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 11

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Dublin Core Metadata Initiative: 15 Elements

• Title, Subject/Keywords*, Description

• Creator, Publisher, Contributor

• Date, Type, Format

• Relation, Coverage, Rights

• Source, Language, Identifier

<meta name="DC.Identifier" scheme="URI"content="http://www.informationdesign.org" />

*Meaning in the SUBJECT/KEYWORDS tag, other tags are fordocument management.

See also: dublincore.org

Page 12: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 12

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Controlled vocabulariesOrganizing words and phrases

Page 13: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 13

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

CVs: Organized Words and Phrases

• “… organized lists of words and phrases (…) that are used to intiallytag content, and than to find it through navigation or search.” (AmyWarner)

• No CV: multiple terms for identical concepts -> chaos

• Closed list of named subjects, which can be used for classification.

• Creating a common language between user and system.

• A type of metadata that functions as a subset of natural language.

Page 14: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 14

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

TaxonomyCarl Linnaeus Goes Digital

Page 15: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 15

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Taxonomy: Carl Linnaeus (1700’s) Goes Digital

• A taxonomy is a complex CV

• One type of relation between terms: broader/narrower term in thehierarchy.

• A subject-based classification that arranges the terms in the CVinto a hierarchy.

Page 16: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 16

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

ThesaurusBT/NT, RT, SN, and USE/UF

Page 17: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 17

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Thesaurus: BT/NT, USE/UF, SN and RT

• Extend taxonomies to describe the world better.

• ISO standard 2788 - Properties:

– BT: Broader term - one level up in the hierarchy

– NT: Narrow term / Inversed with BT

– SN: Scope note (Explanation of meaning of the term)

– RT: Related term (No synonym or BT/NT: ‘See also’)

– USE: Other term preferred/synonym /Inversed with UF

• To provide a much richer vocabulary for describing the terms thantaxonomies do.

Page 18: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 18

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Thesaurus: Example (Karl Fast et al.)

Jeans

• BT Pants

• NT Levis

• NT Wranglers

• UF Dungarees

• UF Waist Overalls

• RT Denim

• RT Overalls

Denim

• BT Fabrics

• NT Ring Spun

• NT Dark Indigo

• NT Stonewash

• RT Jeans

Page 19: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 19

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

OntologyA Specification of a Conceptualization

Page 20: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 20

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Ontology: A Specification of Conceptualization

• Derivate of artificial intelligence (Logical inferencing)

• “… a formal explicit description of concepts in a domain ofdiscourse (classes (sometimes called concepts)), properties ofeach concept describing various features and attributes of theconcept (slots (sometimes called roles or properties)), andrestrictions on slots (facets (sometimes called role restrictions)).”

• There is no one correct way to describe a domain.

• A model for describing the world that consists of a set of topics,properties, and relationship types.

• Fixed versus open vocabularies.

Page 21: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 21

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Faceted ClassificationAnalysis and Synthesis

Page 22: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 22

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Faceted Classification: The Elephant

Page 23: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 23

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Faceted Classification: Analysis and Synthesis

• S.R. Ranganathan (1892-1972)

• Facet: ‘a clearly defined, mutually exclusive, andcollective exhaustive aspects, properties orcharacteristics of a class or specific subject.’

• Describing documents from various perspectives.

• A special purpose controlled vocabulary.

Page 24: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 24

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

eXchangable Faceted Metadata Language

• A language to exchange metadata between websites.

• XFML Core aka XFML 1.0 (Peter van Dijck et al. 2002)

• Categories, subcategories, and faceted metadata.

• Open XML format for publishing and connecting faceted metadataof websites.

• An XFML file contains TOPICS, organized in FACETS.

• Effectively seperating navigation from content.

See also: xfml.org

Page 25: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 25

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

XFML: Example<?xml version="1.0"?>

<xfml version="1.0" url="http://xfml.org/spec/example.xml" language="en-us">

<mapInfo>

<managingEditor>

<name>Peter Van Dijck</name>

<email>[email protected]</email><url>http://petervandijck.net</url>

</managingEditor>

</mapInfo>

<facet id="place_to_go">places to go</facet>

<!-- TOPICS -->

<topic id="bogota" facetid="place_to_go" parentTopicid="colombia">

<name>Bogota</name>

</topic>

<page url="http://poorbuthappy.com/colombia/topics.php">

<title>Guide to Colombia topics page</title>

<occurrence topicid="diving"/>

<occurrence topicid="bogota"/>

</page>

</xfml>

Page 26: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 26

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Topic MapsThe GPS of the Information Universe

Page 27: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 27

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Topic Maps: The GPS of the Info Universe

• “The purpose of a topic map is to convey knowledge aboutresources through a superimposed layer, or map, of the resources.A topic map captures the subjects of which resources speak, and therelationships between subjects, in a way that is implementation-independent.”

• A model to describe knowledge structures.

• A topic map is a data structure.

• Key concepts: (typed) Topics, Associations, and Occurences.

• Topic Maps can represents controlled vocabularies, taxonomies,thesauri, and faceted classification.

• XML Topic Maps 1.0 (valid XML)See also: topicmaps.org

Page 28: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 28

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Topic Map: Example (Garshol)

Page 29: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 29

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

RDFResource Description Framework

Page 30: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 30

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

RDF: Resource Description Framework

• Alternative to Topic Maps.

• R.V. Guha @Apple Meta Content Framework.

• A framework for representing information on the Web.

• XML app -> W3C Recommendation (1999) for the expression ofany kind of target.

• Key concepts: Resources, Properties, and Statements.

Page 31: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 31

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

The Semantic WebA Vision of the Future

Page 32: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 32

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

SemWeb: A Vision of the Future

• “The Semantic Web is an extension of the current web in whichinformation is given well-defined meaning, better enablingcomputers and people to work in cooperation.” - TBL

• Vision: Well-defined data on the Web that can be used by machinesfor automation, integration and re-use.

• The Web can reach its full potential: data to be shared andprocessed by automatic tools.

• Based upon RDF and theWeb Onology Language

See also: semanticweb.org

Page 33: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 33

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

Discussion

? & !

Page 34: Metadata and XML - BogieLand · 10 november 2004 Metadata and XML 1 ... Jeans • BT Pants • NT Levis ... • An XFML file contains TOPICS, organized in FACETS.

10 november 2004 Metadata and XML 34

InfoDesign: Understanding by Design Peter J. Bogaardswww.bogieland.com

BogieLandinformation design & information architecture

http://www.bogieland.com

Peter J. [email protected]