NetarchiveSuite Workshop, November 24, 2011, Paris 1 Web@rchive Austria Using Wayback for Access and QA Andreas P. Austrian National Library webarchiv@onb.ac.at.

Post on 05-Jan-2016

217 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

Transcript

NetarchiveSuite Workshop, November 24, 2011, Paris 1

Web@rchive AustriaUsing Wayback for Access and QA

Andreas P.

Austrian National Librarywebarchiv@onb.ac.athttp://webarchiv.onb.ac.at

NetarchiveSuite Workshop, November 24, 2011, Paris 2

Wayback for Access

• Currently using Version 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library

NetarchiveSuite Workshop, November 24, 2011, Paris 3

Wayback for Access

NetarchiveSuite Workshop, November 24, 2011, Paris 4

Wayback for Access

• Currently using Version 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library• Pre-search for Hostnames (stored in DB, will be

replaced by Solr-Fulltextindex over all Objecturls)

NetarchiveSuite Workshop, November 24, 2011, Paris 5

Wayback for Access

• Pre-search for Hostnames with Sql

• Auto Suggest implemented by using Ajax Auto Suggest 2.1.3[1]

• Screencast on http://www.screenr.com/AZ0

[1]http://www.brandspankingnew.net/specials/ajax_autosuggest/ajax_autosuggest_autocomplete.html

NetarchiveSuite Workshop, November 24, 2011, Paris 6

Wayback for Access

• Screencast on http://www.screenr.com/mZ0

NetarchiveSuite Workshop, November 24, 2011, Paris 7

Wayback for Access

• Currently using Version 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library• Pre-search for Hostnames (stored in DB, will be

replaced by Solr-Fulltextindex over all Objecturls)• Displays Thumbnails of search results (pre-

generated, not on the fly)

NetarchiveSuite Workshop, November 24, 2011, Paris 8

Wayback for Access

[1] http://cutycapt.sourceforge.net/

• Thumnbails pre-generated with CutyCapt[1]

NetarchiveSuite Workshop, November 24, 2011, Paris 9

Wayback for Access

• Using Version Wayback 1.4.2• Wayback running on clients without Internet Access –

no online/offline content mixing• Archival URL Replay Mode• Default Jsp customized to corporate design of the

library• Pre-search for Hostnames (stored in DB, will be

replaced by Solr-Fulltextindex over all Objecturls)• Displays Thumbnails of search results (pre-

generated, not on the fly)• Using Web Archive Access Control to block pages

for a period

NetarchiveSuite Workshop, November 24, 2011, Paris 10

Wayback for Access

• Access Control by using Web Archive Access Control[1]

[1]https://webarchive.jira.com/wiki/display/wayback/Exclusions+API

NetarchiveSuite Workshop, November 24, 2011, Paris 11

Wayback for Access

NetarchiveSuite Workshop, November 24, 2011, Paris 12

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

NetarchiveSuite Workshop, November 24, 2011, Paris 13

Wayback for QA

• Selecting scope for QA

NetarchiveSuite Workshop, November 24, 2011, Paris 14

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

• QA-Tool copies files to QA-Server and builds index for Wayback Machine

NetarchiveSuite Workshop, November 24, 2011, Paris 15

Wayback for QA

• Copying Arc-Files and building index for Wayback

NetarchiveSuite Workshop, November 24, 2011, Paris 16

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

• QA-Tool copies files to QA-Server and builds index for Wayback Machine

• Browse seeds within selected Time Range

NetarchiveSuite Workshop, November 24, 2011, Paris 17

Wayback for QA

• Browse Seeds Copying Arc-Files and building index for Wayback

NetarchiveSuite Workshop, November 24, 2011, Paris 18

Wayback for QA

• Define Time-Range and Crawl/Harvestdefinition for QA

• QA-Tool copies files to QA-Server and builds index for Wayback Machine

• Browse seeds within selected Time Range

top related