Top Banner
Content Working Group Archive-It Partner Meeting November 18, 2014 Barrage balloon manufacture... ” by Alfred T. Palmer under public domain Nicholas Taylor (@ nullhandle) Web Archiving Service Manager Stanford University Libraries A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report
35

A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Jul 12, 2015

Download

Internet

nullhandle
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 2: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

NDSA Web Archiving Survey Working Group

Jefferson BaileyInternet Archive / Archive-It

Kristine HannaInternet Archive / Archive-It

Edward McCainUniversity of Missouri

Cathy HartmanUniversity of North Texas

Abbie GrotkeLibrary of Congress

Christie MoffattNational Library of Medicine

Nicholas TaylorStanford University

Page 3: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

NDSA Web Archiving survey background

2011

• 78 respondents

• program info

• tools and services

• access

• policies

2013• 92 respondents• program info

• staff time, metrics, skills, content concerns

• tools and services• access and discovery

• new discovery options

• policies• embargo, social media,

robots.txt, resources

Page 6: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

universities still make up most programs

College or University

47%

Archive13%

State Gov13%

Other12%

Fed Gov8%

Commercial2%

Public Library

2%Museum

3%

2011

College or University

52%Archive

15%

State Gov13%

Other8%

Fed Gov5%

Commercial4%

Public Library

2%Museum

1%

2013

Page 9: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

web/archiving tech savviness are key skills

39% 37%

24%21% 21%

10%6% 6%

0%5%

10%15%20%25%30%35%40%45%

Percentage of organizations

Page 10: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

data volume and archive use are key metrics

53%

47%

22%20%

8%4% 4%

0%

10%

20%

30%

40%

50%

60%

Volume Usage Cost Quality Buy-in Loss Policy

Percentage of organizations

Page 12: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

programs have matured slightly since 2011

64%

16% 17%

4%

72%

14%9%

2%

0%

10%

20%

30%

40%

50%

60%

70%

80%

Active Testing Planning No longer collecting

2011 2013

Page 13: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

strong perceptions of progress since 2011

Significant progress40%

Some progress36%

About the same20%

Slightly worse off2%

Much worse off2%

Page 14: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

many new programs since 2011

10

3

0

21

2

0

23

8

65

4

67

12

19

0

2

4

6

8

10

12

14

16

18

20

1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013

Number of organizations

Page 15: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

two-thirds of them now use Archive-It

0 0 1 02

0 1 0 1 0

3 31 2

42

64

1 0

2

0

0

11

0

1 3

53

4 2

2 5

6

15

0

2

4

6

8

10

12

14

16

18

20

1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013

Number of organizations Archive-It Partner as of 2013

Page 17: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

more programs are only self-archiving

31%

49%

20%

15%

48%

37%

0%

10%

20%

30%

40%

50%

60%

Archive other sites only Archive both Archive own site only

2011 2013

Page 18: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

concern about social media, databases, video

6965 64

49

40

32

16

0

10

20

30

40

50

60

70

80

Social Media Databases Video InteractiveMedia

Audio Blogs Art

Number of organizations

Page 19: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

untapped interest in collaboration

21%

72%

7%

17%

47%

33%

2%

0%

10%

20%

30%

40%

50%

60%

70%

80%

Yes No Not yet, but interested Don't know

2011 2013

Page 21: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

web archiving as a service still most popular

60%

25%

14%

63%

20%16%

0%

10%

20%

30%

40%

50%

60%

70%

External In-house Both

2011 2013

Page 22: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

data not transferred from service provider

19%

81%

20%

80%

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

Transferred Haven't transferred

2011 2013

Page 23: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

increased use of tools supporting W/ARC

24%

76%

38%

62%

0%

10%

20%

30%

40%

50%

60%

70%

80%

Supports W/ARC Doesn't support W/ARC

2011 2013

Page 24: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

less granular descriptive metadata

62% 66%

47%55%

30%36%

54%60%

43%50%

22%18% 20%

5%

20%

0%

10%

20%

30%

40%

50%

60%

70%

2011 2013

Page 26: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

most don’t notify or seek permission

42 4245

17

7

1114 13

15

0

5

10

15

20

25

30

35

40

45

50

Capture Provide restricted access Provide public access

No action Notify Request permission

Page 27: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

more conditional handling of robots.txt

38%33%

8%

21%22%

55%

8%

16%

0%

10%

20%

30%

40%

50%

60%

Always respect robots.txt Sometimes/conditionallyrespect robots.txt

Never respect robots.txt Don't know

2011 2013

Page 28: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

social media archiving policies are uncommon

Has social media archiving policy

24%

Lacks social media archiving policy

76%

Page 29: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

policies based on community practices

54%

40%

25%

11%

5% 5% 7%

0%

10%

20%

30%

40%

50%

60%

Otherorganizations

ARL Code ofBest Practices

Section 108Study Group

Counsel orservice provider

Oakland ArchivePolicy

Statute Don't know

Percentage of organizations

Page 31: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

profile of the average survey respondent

• university archive

• started in last three years

• Archive-It user

• ¼ FTE web-savvy archivist

• concerned w/ content capture, cost, and use

• broad level of description

• ambivalent about collaboration

“Container” by Glyn Lowe under CC BY 2.0

Page 32: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

maturity and convergence

• maturity• 75% cite some or significant progress since 2011

• 38% started programs since 2011

• 8% more programs in active status since 2011

• convergence• 79% using external service providers

• 81% devoting ½ FTE or less to web archiving

• 67% rely on community practices for policy-making

• 13% more using Wayback since 2011

Page 33: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

challenges and opportunities

• challenges• 53% concerned about data volume growth

• 47% concerned about fostering access

• more than 73% concerned about content capture

• opportunities• 33% interested but not yet involved in collaborations

• 76% lack social media archiving policies

• less than 23% of archived materials are described

Page 34: A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey Report

Content Working Group

implications and questions

• implications• web archiving not (yet) a top institutional priority

• demand for ongoing Archive-It technical investment

• U.S. web archiving landscape is changing quickly

• questions• how to build institutional support?

• collaboration with whom and on what?

• what’s not being archived?

• how well are we curating what we do archive?