TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Founder and Publisher, Library Technology Guides http://www.librarytechnology.org/ http://twitter.com/mbreeding ESSSS Digital Archive Workshop February 4, 2012
Technology Support for ESSSS. Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Founder and Publisher, Library Technology Guides http://www.librarytechnology.org/ http://twitter.com/mbreeding. Progress, Issues, and Challenges. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
TECHNOLOGY SUPPORT FOR ESSSS
Progress, Issues, and Challenges
Marshall BreedingDirector for Innovative Technology and ResearchVanderbilt University LibraryFounder and Publisher, Library Technology Guideshttp://www.librarytechnology.org/http://twitter.com/mbreedingESSSS Digital Archive WorkshopFebruary 4, 2012
The contents of the page images contain valuable data
Page images can be read by humans but do not support essential features: search, computer analysis, etc.
Full value of these collections can be realized through transcription
Challenges in transcription
Page characteristics Hand written by many different hands Many names and numbers Spanish language Varying contrast Many defects: water damage, insects, etc
Human transcription
Scholars that work with pages of interest can create transcriptions manually
Optical character recognition? Highly accurate for typescript Not effective for handwritten manuscripts
Crowdsourcing
Find ways to have large numbers of persons create transcript snippets
Google uses crowdsourcing to improve transcripts for Google Books project.
Google ReCAPTCHA:
“Digitizing books one word at a time” Each transaction transcribes one or two
words Each word is transcribed many times Results compared to determine correct
version
Google ReCAPTCHA
Crowdsourcing to Transcribe ESSSS Scholars contribute any transcriptions
created as they work with any given set of pages
Students assigned to create transcriptions Language, history, LIS
Collaboration with some organization with ReCAPTCHA like infrastructure