Webzine Technologies Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by Resource: The Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the Higher and Further Education Funding Councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath where it is based. Email [email protected]URL http://www.ukoln.ac.uk/ Acknowledgments to Marieke Napier, UKOLN who contributed to these slides
71
Embed
Webzine Technologies Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by Resource: The Council for Museums, Archives and.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Webzine Technologies
Brian Kelly
UK Web Focus
UKOLN
University of Bath
Bath, BA2 7AY
UKOLN is funded by Resource: The Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the Higher and Further Education Funding Councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath where it is based.
• Webzine will refer to an online (or Web) publication• Webzines can include:
Online journals (Web journals) – peer-reviewed Online / Web magazines – not peer-reviewed – focus of
this talk
• Some use term Webzine to refer to online fanzines
NOTE: A pragmatic definition for this seminar only - which focuses on technical and not philosophical issues!
See <http://www.stm-assoc.org/intconta/aaas/recommend.htm> for a more rigourous proposal and <http://www.issn.org/brochure/The_ISSN.htm> for a discussion about serials and ISSN
NOTE: Stevan Harnad will cover eprint archives next month
NOTE: A pragmatic definition for this seminar only - which focuses on technical and not philosophical issues!
See <http://www.stm-assoc.org/intconta/aaas/recommend.htm> for a more rigourous proposal and <http://www.issn.org/brochure/The_ISSN.htm> for a discussion about serials and ISSN
NOTE: Stevan Harnad will cover eprint archives next month
13
Why Do it?
Reasons to provide a Webzine:• Accessibility
• To a wider audience • Worldwide• Reduction of cost?• Functionality
• Fewer limits on article size • Image (you’re progressive in IT)• …
UKOLN also uses Webzines to provide a testbed for our research interests
14
The Negative Aspects
• Resourcing• Financial implications• Loss of some audience• May require a change of culture• Loss of control of information• Development of technical architecture
(addressed here)
15
The Reader’s View
What might a reader expect from our Web publication?
What approaches can we, as publishers, take to satisfy these requirements?
16
What Does the Reader Want?
Well-written, interesting, informative, relevant article Article is easy to find
Publication is easy to find
Article is easy to read
Article is accessible
No hype
Can comment on article
Can contact author
Can read comments from others
Easy printing
Issues:Are these requirements the same as for the author?How easy is it for the publisher to implement?
Issues:Are these requirements the same as for the author?How easy is it for the publisher to implement?
Content is king:• Useful to have a quality publication already• Need to encourage and nurture your authors• Can you use a stick?• Think about syndication and reuse of existing
content
18
Add VarietyA publication full of factual articles can be boring. Why not add a variety of content?
As well as adding variety, games, crosswords, etc this can allow the publisher to evaluate new technologies such as DHTML (scramble game), Java (crossword), user-agent-driven links to technologies (both), etc.
“Having had a look around at several web sites and e-journals, we found that you have managed to combine simplicity, elegance, completeness and advanced technical backing.”Susan Leech O'Neale, CERN, Sep 99 http://www.exploit-lib.org/issue5/indicators/http://www.exploit-lib.org/issue5/indicators/
New Reader FunctionsIdeas for new functionality for the readers:
• Automated news feeds (summaries of articles) which can be embedded in third party Web sites
• Summary information available using WAP phones [to evaluate ease of conversion of other XML types]
• Email access to newsletter • View similar articles (using search and metadata)• Personalised interface• VRML, simulation, …
New author / editorial board functions could also be added:• If cookie=“editor” display validation checks• If cookie=“author” display statistics
36
The Author’s View
What might an author expect from our Web publication?
What approaches can we, as publishers, take to satisfying these requirements?
37
What Does the Author Want?
To provide everything the reader needs:(quality article, easy-to-find, annotation, etc)
A simple publication process
Stats on access to article
A sophisticated publication process
Value added services from editorial processes
Kudos (and hits) through proximity to others
Submission to search engines
Metadata and advanced searching
Search engine friendly Web site
A “permanent” location for the article
For the article to have an “impact”
An attractive, usable designThe Author
38
GuidelinesIt is important to provide guidelines for authors:
• Background about the Webzine (aims, target audience, etc.)
• Requirements for article• Submission procedures• Editorial control and
conflict resolution• Use of images• Copyright
http://www.cultivate-int.org/author-guidelines/
http://www.cultivate-int.org/author-guidelines/
Encourage authors to make use of photographic digitisation services such as <http://www.fotango.com/> (e.g. photos of conferences) and clip art collections?
Encourage authors to make use of photographic digitisation services such as <http://www.fotango.com/> (e.g. photos of conferences) and clip art collections?
39
Upload ProcessesIt is desirable to enhance the processes for submission and processing of articles submitted by authorsCurrently:
• Files emailed to editor alias (which has a Web archive)We’d like:
• Web-based upload area, with author supplying metadata in standard format
• Automated MS Word -> HTML fragment conversion• Publish in pre-publication
area with automated link creation
Can we do this with limited technical resources and budget? We need to exploreMS SiteServer features
Author Upload Area
File Browse
AbstractArticle Type: Feature News
Expiry date (news items):
40
Authoring IssuesAuthor
Wordformat
MS Word
HTML tool HTML
format
Email
FTP
HTML importfilter
HTMLformat
HTMLformat
Word / RTF HTMLconverter
QA
(fo
rmat
, p
roo
fin
g,
etc)
QA
(fo
rmat
, p
roo
fin
g,
etc)
Editor
HTML form
HTTP / WebDav
Need to avoid bottleneck in Word HTML processing and misuse of HTML templates
Need to avoid bottleneck in Word HTML processing and misuse of HTML templates
QA – ManualAutomated
QAdistributed ed. board
peer-reviewing
41
StatisticsAuthors may want immediate access to statistics for their article
The use of externally-hosted Web statistical services such as Nedstat can provide this with minimal work from the publisher
Total number of pageviews: 2434.
Top day so far: 30 January 2001 with 592 pageviews
Total number of pageviews: 2434.
Top day so far: 30 January 2001 with 592 pageviews
A server-side script (ASP) transforms this into the published version.
For citation purposes:Kelly, B. "What's Happened To My Slides: Giving Presentations at Conferences", Cultivate Interactive, issue 3, 29 January 2001URL: <http://www.cultivate-int.org/issue3/presentations/>
Brian KellyUK Web FocusUKOLNUniversity of BathBathEmail: [email protected] Phone : +44 1225 323943
The approach to providing citations helps us to be consistent
But what should be done when:• An author changes her name (e.g. marriage) or
status (becomes a Professor)?• An author’s email or postal addresses changes?
Ideally we should:• Store information in database and allow reader to
view original and latest detailsFor citation purposes:Kelly, B. "What's Happened To My Slides: Giving Presentations at Conferences", Cultivate Interactive, issue 3, 29 January 2001URL: <http://www.cultivate-int.org/issue3/presentations/>Author’s email address has changed: [View current information]
44
AnnotationJakob’s Neilson Alertbox is a valuable resource for Web developers.
Jakob provides annotation which allows him to add text to previously published articles, without changing the content of his original article
Little use of Hypernews to comment on Exploit Interactive articles Should we:
• Promote more • Have comments inline• Provide timed realtime chats (e.g HumanClick.com) • …
47
Find Article From AltaVista
Search engines can drive much traffic to a Webzine, esp. if articles contain unusual names (e.g. orgs such as CERN):
• Be proactive in ensuring Webzines are indexed by major search engine vendors
• Ensure that Webzines are indexed in a timely manner (just before new issue released)
• Provide a search engine friendly site map (e.g. issue or volume table of contents) containing lists to all (all new) articles and submit it to search engines
48
Tools
Many tools are available for submitting Web sites to search engines, including desktop applications and Web services
NOTE:• Submit the URL of a
page with links to all pages (e.g. table of contents, site map, What’s New page)
Informant sends regular emails with info on position of query in AltaVista
• Why is my Webzine not in top 10 for query?
• Should I resubmit?• Is the service incorrect?• Should I use another service?
Informant sends regular emails with info on position of query in AltaVista
• Why is my Webzine not in top 10 for query?
• Should I resubmit?• Is the service incorrect?• Should I use another service?
50
Avoid Junk Being Indexed
A search for your Webzine gives you:• The pre-release information, contained in the
editor’s personal home page • The pre-publication Web site
To avoid this happening make use of a robots.txt file and the Robot Exclusion Protocol (REP):
robots.txt
User-agent: *Disallow: /
Pre-release Web site
NOTE:Think about this from the startOnce a page is indexed, it’s difficult to get it removed
You can also control robots by using the <meta link=“robots” content=“noindex”> tag in HTML pages (e.g. don’t index news pages)
51
The Publisher’s View
What might a publisher wish to provide for its Web publication?
What approaches can we, as publishers, take to satisfying these requirements?
52
Statistics
An analysis of Web statistics is needed:• For reporting to funders and other stakeholders• To spot trends• To aid in dissemination• For server performance monitoring• To identify browser characteristics
But:• It can be addictive• It can be time-consuming• It can be misleading
For further info on Web statistics see article at <http://www.exploit-lib.org/issue5/indicators/>
For further info on Web statistics see article at <http://www.exploit-lib.org/issue5/indicators/>
53
Approaches to StatisticsCultivate Interactive uses 2 externally hosted statistical services:
News Feeds For Others To UseYou can provide news feeds from your Webzine:
• Additional dissemination• Remote site drives traffic to you
UKOLN has developed CGI and Javascript parsers for RSS news feeds:
• CGI version works for any browser, but use requires CGI privileges
• JavaScript version requires JS-support, but can be used by HTML author
UKOLN has developed CGI and Javascript parsers for RSS news feeds:
• CGI version works for any browser, but use requires CGI privileges
• JavaScript version requires JS-support, but can be used by HTML author
56
Article MaintenanceWeb sites deteriorate:
• Information becomes out-of-date• Hot news becomes stale news• Links start to break• Metadata changes (email and postal address of author, etc.)• New formats are introduced (HTML -> XHTML)• New functionality is introduced (automated translation)
What should be done if:• The links from an article become broken• The author’s email and postal address changes• An article now contains incorrect information• You wish to change the font used initially
Should:• They be fixed• They be left – don’t tamper with published information, as this
sets a dangerous precedent• …
57
What Is The Article?
A published article may be treated as an aggregation of resource fragments:
• Navigational fragments (header, footer, …)• Branding (name of publisher, funder, etc.)• Core content (the bit which would be printed in a
conventional publication)• Additional end user functionality (translate this
article, find similar articles, annotate article, …)• Publisher / author functionality (validate this article,..)• …
58
Permanent Record or Updated Resource?How do you regard an article published in a Webzine:
• A fixed record which should not be changed (unless, possibly, a formal update / reprint process is gone through)?
• A resource which can be updated according to pragmatic criteria (e.g. fixing errors, annotating when out-of-date, but not changing the original meaning)?
59
A Pragmatic ApproachThere is a need for a policy (and related procedures).
This will reflect the role of the Webzine.
How about:• The meaning of published information will not be changed once
an article has been published• References (hyperlinks) which become broken will be
annotated if they are fixed or removed. Original link information will be available to the reader.
• The look-and-feel of the Webzine may be changed retrospectively for published articles. If feasible, it should be possible for a reader to return to the original look-and-feel
• Changes may be made to HTML, etc. elements.• Changes made be made to the navigational and functional
fragments for an article
60
StandardsThe publisher should be interested in standards:
• To maximise potential readership• To provide long-term access to resources• To avoid application / platform dependencies
We use:• HTML (moving to XHTML, but issues over tools)• CSS (but issues over browser support)• Dublin Core metadata in HTML
We are thinking about:• Transforming XHTML into WML using XSLT• Providing DC metadata in RDF• Using RSS for news feeds• Other areas which inform our research activities
61
ArchitectureKey feature – use of neutral fragments which are transformed, easily managed and reused
TitleAuthorURLdate
HeaderFooterArticle body
TranslateSee also
DTDCSSHEAD
Various fragments are processed by server scripts and can be managed
Add new functionality:• Print all• Translate• See also
Manage functionality:• New translation• Update Web stats• Manage metadata
Add new format, etc:• PDF, WAP, eBook,
email, ...• Full list of authors• User-defined views
variables
content
HTML
Render article
Render issue
Createsite map
62
Applications
What applications can be used to implement this architecture?
Cultivate Interactive
MS SiteServerMainly used for indexing functionAlso provides site management toolsAuthor upload facility is of interest
HTML Authoring ToolsHotMetal / FrontPage / HTML Kit (under review)
Web StatsNedstats / Sitemeter (Web-based)Misc submission tools
Software Development (ASP Scripts)Visual InterdevNotepad
63
Applications – Licensed or Free?
Background:• Limited budget available for Cultivate Interactive• Limited resources for software development
(bought-in Postgraduate help)• Provided opportunity to evaluate / report on
potential for (free) Web-based services - ASPs (Application Service Providers)
Use of mainly free Web-based service:• Web statistics• Submission to search engines• Polling services• Monitoring service availability
Use of mainly free Web-based service:• Web statistics• Submission to search engines• Polling services• Monitoring service availability
64
Applications – A CMS
The model for Cultivate Interactive has worked wellFor a larger-scale project, a CMS (Content Management System) may be needed:
• Manage access by multiple editors• Manage access rights (design can change
appearance, editor can change content)• Manage workflow processes
• Manage application developmentIssues:
• Open source (e.g. Zope) vs. Licensed• Capital costs vs development costs• Entry point (size of Webzine, budget, …)• Support
author editor pre-release site review publication
65
IdentifiersPublisher should provide short memorable identifiersIt’s wise to avoid platform & application dependenciesIt’s also useful to use directories to group issues and articles
Longer, format specific, scope for confusion: .htm or .html
Also application-specific, possible mirroring / indexing problems
Very application specific, not indexable, caching and mirroring problems But use of a database helps with Web site maintenance
NOTE: database generated resource can have static URL
66
Mirroring / PreservationIf your Webzine is popular you be be asked if it can be mirrored (low bandwidth areas, or Intranets, ..)You may also wish to consider the longterm preservation of the WebzineIssues:
• Absolute vs relative URLs• The root of your Web site and its structure• Mirroring the content or the application• Sucking the Webzine vs pushing (replicating) it• Navigation on a mirrored site (where is home?)• Mirroring exactly vs rewriting bits
If a site can be mirrored easily, it’s more likely to be easily preserved or transferred to other platforms. So even if mirroring doesn’t appear
to be of interest, there may be other benefits
If a site can be mirrored easily, it’s more likely to be easily preserved or transferred to other platforms. So even if mirroring doesn’t appear
to be of interest, there may be other benefits
67
D-LibD-Lib is mirrored at UKOLN at <http://mirrored.ukoln.ac.uk/lis-journals/dlib/dlib/dlib.html>
NOTE:• Search and the results are
not at mirror • Long URL of UK mirror
68
Developments With IdentifiersLibrary professionals have an interest in persistent addressingURLs break when:
• An organisation is renamed
• A Web site is reorganised
Possible solutions include:
• DOIs (but mirroring issues)
• OpenURLs (address mirroring and resolution)
• …Area is of interest to UKOLN and JISC
See <http://www.w3.org/Provider/Style/URI> for background info
See <http://www.w3.org/Provider/Style/URI> for background info
69
Funding Issues
How is our Web publication to be funded?
What suggestions do you have?
70
Funding Options
Your Webzine could be funded:• In-house: it’s part of your job• From savings from replacing the print edition• Through advertising revenue
(see <http://www.ukoln.ac.uk/web-focus/events/conferences/ili-2001/advertising/>)
• Through subscription• From grants, project funds, etc.• …
Any comments on these options?Which is likely to be feasible for your
publication (and your culture)?
Any comments on these options?Which is likely to be feasible for your