What we want with web- archives: will we win? Kevin Ashley ULCC Digital Archives Department http://dablog.ulcc.ac.uk/ W8.0
Jan 26, 2015
What we want with web-archives: will we win?
Kevin Ashley
ULCC Digital Archives Department
http://dablog.ulcc.ac.uk/
W8.0
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
2
Past histories
• Tom Standage – The Victorian Internet
• Not just what was said, but how
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
3
http://vimeo.com/2312662
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
4
Thinking about use cases
• Not just document-centred
• Content
• Properties of content
• The web of data
• The web as data
• Stuff about the web as well as from the web
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
5
Document-centred is useful
• For many academic uses, still central
• Sometimes content, sometimes presentation, sometimes both
• Timeslices or places over time:
• Brian Kelly's history of University of Bath homepage
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
6
Content in aggregate
• Textual analysis
• Contrasting use of language
• Tracking spread of neologisms
• Word clouds
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
7
Properties of content
• How quickly was PNG adopted ?
• Was takeup uniform in countries, types of site ?
• What did it replace ?
• What happened to XPM ?
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
8
Searching the past
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
9
The web as data
Hidekazu Shiozawa and Yutaka Matsushita – “Natto”
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
10
The web of data
• Linked data:
“a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web”
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
11
http://taggalaxy.de/
APIs that allow alternate views
• Archives collect, protect and provide permanent references for content
• APIs allow many views and uses to emerge
• They permit intelligent intermediaries to do our work, or to assist
• Important as archive space fragments
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
12
Other stuff on or about the web
• Traditional media about the web
• Usage logs, server configs, server software
• Browsers, plugins, validators, …
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
13
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
14
2009-07-21 Kevin Ashley: http://dablog.ulcc.ac.uk/
15
Thanks to Martin Dodge’s cyber-geography pages