Top Banner
NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library [email protected] http://webarchiv.onb.ac.at
19

NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library [email protected].

Jan 05, 2016

Download

Documents

Claire Benson
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 1

Web@rchive AustriaUsing Wayback for Access and QA

Andreas P.

Austrian National [email protected]://webarchiv.onb.ac.at

Page 2: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 2

Wayback for Access

• Currently using Version 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library

Page 3: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 3

Wayback for Access

Page 4: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 4

Wayback for Access

• Currently using Version 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library• Pre-search for Hostnames (stored in DB, will be

replaced by Solr-Fulltextindex over all Objecturls)

Page 5: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 5

Wayback for Access

• Pre-search for Hostnames with Sql

• Auto Suggest implemented by using Ajax Auto Suggest 2.1.3[1]

• Screencast on http://www.screenr.com/AZ0

[1]http://www.brandspankingnew.net/specials/ajax_autosuggest/ajax_autosuggest_autocomplete.html

Page 6: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 6

Wayback for Access

• Screencast on http://www.screenr.com/mZ0

Page 7: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 7

Wayback for Access

• Currently using Version 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library• Pre-search for Hostnames (stored in DB, will be

replaced by Solr-Fulltextindex over all Objecturls)• Displays Thumbnails of search results (pre-

generated, not on the fly)

Page 8: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 8

Wayback for Access

[1] http://cutycapt.sourceforge.net/

• Thumnbails pre-generated with CutyCapt[1]

Page 9: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 9

Wayback for Access

• Using Version Wayback 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library• Pre-search for Hostnames (stored in DB, will be

replaced by Solr-Fulltextindex over all Objecturls)• Displays Thumbnails of search results (pre-

generated, not on the fly)• Using Web Archive Access Control to block pages

for a period

Page 10: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 10

Wayback for Access

• Access Control by using Web Archive Access Control[1]

[1]https://webarchive.jira.com/wiki/display/wayback/Exclusions+API

Page 11: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 11

Wayback for Access

Page 12: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 12

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

Page 13: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 13

Wayback for QA

• Selecting scope for QA

Page 14: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 14

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

• QA-Tool copies files to QA-Server and builds index for Wayback Machine

Page 15: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 15

Wayback for QA

• Copying Arc-Files and building index for Wayback

Page 16: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 16

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

• QA-Tool copies files to QA-Server and builds index for Wayback Machine

• Browse seeds within selected Time Range

Page 17: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 17

Wayback for QA

• Browse Seeds Copying Arc-Files and building index for Wayback

Page 18: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 18

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

• QA-Tool copies files to QA-Server and builds index for Wayback Machine

• Browse seeds within selected Time Range

Page 19: NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

NetarchiveSuite Workshop, November 24, 2011, Paris 19

More Information:http://webarchiv.onb.ac.at

Social Media:http://twitter.com/AT_Webarchivehttp://www.facebook.com/ATWebarchivehttp://www.slideshare.net/ATWebarchivehttp://screenr.com/user/AT_Webarchive

Questions?