Top Banner
An exercise in preservation and applied technology Making an Electronic Text
8

An exercise in preservation and applied technology Making an Electronic Text.

Dec 13, 2015

Download

Documents

Cameron Lang
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: An exercise in preservation and applied technology Making an Electronic Text.

An exercise in preservation and applied technology

Making an Electronic Text

Page 2: An exercise in preservation and applied technology Making an Electronic Text.

Published in 1871only 456 copies printed This book is a collection of broadsides, ballads, and popular stories in Dickensian London

Charles Hindley’s Curiosities of Street Literature

Page 3: An exercise in preservation and applied technology Making an Electronic Text.

Using High quality scanned images and OCR software we have created text documents from the scanned images Using XML we are then able to “Mark-up” the documents for display on the web. We are following a defined standard for electronic texts. The TEI, or Text Encoding Initiative.

What we are doing

Page 4: An exercise in preservation and applied technology Making an Electronic Text.

This standard was defined by the University of Oxford, Brown University, University of Bergen, and the University of VirginiaTEI consortium formulated their guidelines to facilitate interchange between individuals and groups using different programs and computer systems over a broad range of applications

Text Encoding Initiative

Page 5: An exercise in preservation and applied technology Making an Electronic Text.

To make the TEI defined documents as accessible as possible a cross platform mark-up language was chosenA mark-up language can be as simple as HTML (Hyper Text Mark-up Language) As complex as LaTeXAs user definable as XML (eXtensible Mark-up Language)

Page 6: An exercise in preservation and applied technology Making an Electronic Text.

eXtensible Mark-up LanguageChosen By TEI for it’s cross platform, multi-application capabilities.The user defines the mark-up in XMLcustom tag and search XML documents based on those tags

XMLWhy it’s good for you

Page 7: An exercise in preservation and applied technology Making an Electronic Text.

Each image, scanned saves as a 40 Megabyte uncompressed TIFF Using OCR (optical character recognition) software, we are able to preserve the text.

The Images

Page 8: An exercise in preservation and applied technology Making an Electronic Text.

Once the image has been OCR’ed, a text document is createdthese text documents can then be marked up in XMLMarkup can be done is software or manually

The Text