Cambridge Venture Project Digital Preservation and Long Term Access Functionality Group 12: Yunghan Au, Rizik Kandalaft, Ming Kuang, Savita Nair Thursday, December 9, 2010 Disclaimer: This work has been undertaken as part of a student educational project and the material should be viewed in this context. The work does not constitute professional advice and no warranties are made regarding the information presented. The Authors, Cambridge Judge Business School and its Faculty do not accept any liability for the consequences of any action taken a result of the work or any recommendations made or inferred.
39
Embed
Cambridge Judge Business School, Market Research Digital Preservation
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Cambridge Venture ProjectDigital Preservation and Long Term Access Functionality
Group 12: Yunghan Au, Rizik Kandalaft, Ming Kuang, Savita NairThursday, December 9, 2010
Disclaimer: This work has been undertaken as part of a student educational project and the material should be viewed in this context. The work does not constitute professional advice and no warranties are made regarding the information presented. The Authors, Cambridge Judge Business School and its Faculty do not accept any liability for the consequences of any action taken a result of the work or any recommendations made or inferred.
Agenda
Introduction Digital Preservation Open Planets Foundation
Project Objectives
Research Methodology
Secondary Research
Primary Research: Healthcare Industry
Summary & Recommendations
?
Open Planets Foundation
Independent, not-for-profit organization, derived from EU funded Planets Project
Commitment to an open-source framework
Membership mainly Memory Organizations
Migration & emulation services to actively prevent data format obsolescence
Promotion of digital preservation practices for homes and businesses
Project Objectives
Investigate global demand for digital preservation Global market size > $1 Bn
Identify a target industry in the corporate market Healthcare
Primary research and analysis of Healthcare Industry 92% interested in long-term data access 80% estimate cost of data loss to be high
Consultancy interest in OPF offerings Interested if established as an industry standard
Market Size (1 of 2)
1 Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at: http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).2 EB = 1,024 PB; PB = 1,024 TB; TB = 1,024 GB3 Schofield, J. (2010) Google finally enters the online storage arena with a free 1GB. Available at: http://www.guardian.co.uk/technology/blog/2010/jan/12/google-docs-storage-cloud-gdrive (Accessed: 8 December 2010).
Key consideration: Transaction records Large fines and penalties for lost customer
information Regulations require record retention for only 3-
6 years "Financial services industry accounts for just
6% of the digital universe today and will fall to 3% by 2011."1
1Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at: http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
69% healthcare organizations expect data volumes to grow1
Picture archiving and communication systems (PACS) EPR files Scanned documents, primarily PDF
UK Department of Health: “EPRs must not be destroyed, or deleted, for the foreseeable future”2
7/24/365 accessibility crucial
Potential size of market is uncertain Lifetime of EPR expected to be 125 years3
Appendix 3: Healthcare
1Research Diagnoses the Underlying Causes of Healthcare Data Upsurge, www.bridgeheadsoftware.com2Records Management, NHS Code of Practice, Part 2 (2nd Edition). Available at: http://www.dh.gov.uk/ (Accessed: 7 December 2010)3Scott, R.E. (2007) ‘e-Records in health – Preserving our future’, International Journal of Medical Informatics, 76 (2007), pp. 427-431. ScienceDirect [Online]. Available at: http://www.sciencedirect.com (Accessed: 4 November 2010).
Key consideration 1: Digital Earth observation data
High cost of lost data for future generations Climate change effects on Earth’s topography
No definition of retention time period Primarily dependent on government budgets
Short term administrations unlikely to invest in long term preservation
Understands the need for digital preservation, struggles to articulate value -> Alliance with OPF?
Appendix 5: Defense
Key consideration 2: Deployment cycle A useful life of 10 years or more are expected
from storage systems “Some military mission flight recorders (black
boxes) from the late 1970s are still in service”1
Data loss is not an option Irreplaceable data Highly sensitive applications
1Howard, C.E. (2009) ‘Secure information storage: military and aerospace equipment designers have a wealth of online options available to fill the ever-increasing need for secure data storage in the field’, Military and Aerospace Electronics, Volume 20, Issue 12. Factiva [Online]. Available at: http://global.factiva.com (Accessed: 4 November 2010).
Key consideration: 3D Seismic data Sensors monitor activity at exploration
point and transfer information to headquarters to enable effective decision making
Chevron accumulates 2 TBs a day1
Geological data set for an oil field = 200 TB1
1Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at: http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
Key consideration: Product life cycle Long term use of product data is “hampered by
the ephemeral nature of CAD file formats and the applications that work with them”
Several applications used in different phases of the product lifecycle
Expected amount of data to be preserved* = 80 EB1
* For 10 years or more
1Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at: http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
Chronopolis is a national center organization for the management, long-term preservation, and promulgation of national digital assets.
Chronopolis addresses this critical problem (digital information) and provide the service of long-term data preservation and access
Non-for-profit; membership
InApp
InApp provides reengineering & migration services to software and hardware. It helps customers reengineer legacy systems, upgrade current products to a new platform, architecture, OS and / or language.
InApp is committed to be a world class business solution provider delivering exceptional value to the customer leveraging on an organization culture that promotes innovation and excellence.
For-profit and offer service
ESA-ESRIN
The largest European earth observation (EO) data provider and operates as the reference European centre for EO payload data exploitation.
Many users are calling for the need to preserve the EO data and ensure long-term accessibility.
Non-for-profit; membership
EMCNo.1 company for data storage service and start researching on preservation
Offer the best service on data storage and access
For-profit and offer service
Appendix 13: Consultant Advisors
Satish Babu – President and Co-founder, InApp Jing Wang – Strategic Business Manager, EMC
China Joseph Abraham – Senior Vice President, HCL
America
Appendix 14: Regulation & Formats
Finance
Cross-sector
Government
Source: Watson Hall Ltd. UK Data Retention Requirements. Available at: https://www.watsonhall.com/methodology/uk-data-retention-requirements.pl (no date) (Accessed: 6 December 2010).
1. Business Insights. (2009) ‘Key Players in Enterprise Security – Strategy, performance and SWOT analysis’, Datamonitor 360 [Online]. Available at: http://360.datamonitor.com/ (Accessed: 1 November 2010).
2. Eastwood, G. (2006) ‘The IT Services opportunity in vertical sectors’, Datamonitor 360 [Online]. Available at: http://360.datamonitor.com/ (Accessed: 1 November 2010).
3. http://www.slideshare.net/rsink/gartner-report-it-spending-2010 (no date). (Accessed: 1 November 2010).
4. Mohammed, A. (2007) ‘Retention challenge’, Computer Weekly, Oct 16, 2007, pp. 26, 28. ABI/INFORM Trade & Industry [Online]. Available at: http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 1 November 2010).
5. Dr. Gishner, B. (2006) The Role of ISO 10303 (STEP) in Long Term Data Retention. Available at: http://edge.cs.drexel.edu/LTKR/Gischner%20-%20STEP%20Presentation%20on%20LTKR.pdf (Accessed: 4 November 2010).
6. Harada, R. (2003) ‘Are you prepared for long-term data preservation? - first in/first out’, Computer Technology Review, Oct 2003 [Online]. Available at: http://findarticles.com/p/articles/mi_m0BRZ/is_10_23/ai_111062977/ (Accessed: 2 November 2010).
7. Ex Libris Ltd. CASE STUDY: DIGITAL PRESERVATION AT THE NATIONAL LIBRARY OF NEW ZEALAND. Preservation: A Forward-Looking Mission. Available at: http://www.exlibrisgroup.com/files/CaseStudy/SunPreservationandNLNZ.pdf (Accessed: 29 October 2010).
8. Gantz, J.F. (2008) The Diverse and Exploding Digital Universe, IDC White Paper, March. Available at: http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Accessed: 3 November 2010).
9. Iron Mountain Incorporated. (2010) New Insights in Records Management Compliance. Available at: http://www.ironmountain.com/webcasts/New-Insights-in-Records-Management-Compliance.html (Accessed: 29 November 2010).
10. Caldwell, F. (2010) Magic Quadrant for Enterprise Governance, Risk and Compliance Platforms. Available at: http://www.gartner.com/technology/media-products/reprints/metricstream/vol3/article2/article2.html (Accessed: 2 November 2010).
11. InTechnology Ltd. (2004) Making Sense of Data Law, InTechnology Managed Services White Paper, April. Available at: http://www.intechnology.co.uk/documents/whitepapers/MakingSense_DataLaw.pdf (Accessed: 2 November 2010).
12. Bleicher, P. (2002) Diamonds May Be Forever, But Data? – Applied Clinical Trials. Available at: http://appliedclinicaltrialsonline.findpharma.com/appliedclinicaltrials/article/articleDetail.jsp?id=87252 (Accessed: 1 December 2010).
13. Covington & Burling LLP. (2006) White Paper on Data Retention of Voice mail for Regulated and Unregulated Industries in the U.S. and E.U. Microsoft Corporation. Available at: http://download.microsoft.com (Accessed: 30 October 2010).
14. Clark, E. (2004) ‘Data retention regulations: keeping it legal’, Network Magazine, Mar 2004, 19 (3), pp. 24-30. ABI/INFORM Trade & Industry [Online]. Available at: http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 7 November 2010).
15. Brooks, P.K. (1997). ‘Records Retention: Producing value from compliance’, ABA Bank Compliance, May/Jun 1997, 18 (5), pp. 42. ABI/INFORM Global [Online]. Available at: http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 7 November 2010).
16. NetIQ Corporation. (2006) Addressing Regulatory Compliance in the Healthcare Industry. Available at: http://download.netiq.com/CMS/WHITEPAPER/NetIQIndustryWP_HC.pdf (Accessed: 7 November 2010).
17. Furness, V. (2007). ‘The future of regulatory compliance - Strategies for a second wave of regulation’, Datamonitor 360 [Online]. Available at: http://360.datamonitor.com/ (Accessed: 1 November 2010).
18. Harris, R. (2010) ‘100 year data preservation’, ZDNet, 22 September. Available at: http://www.zdnet.com/blog/storage/100-year-data-preservation/1093?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+ZDNetBlogs+%28ZDNet+All+Blogs%29 (Accessed: 15 November 2010).
19. Crandall, W. (2008) ‘Best Practices in ECM Backup and Recovery’, Business TechEdge, August 2008 [Online]. Available at: http://techedgemag.com/content/ContentCT.asp?P=275 (Accessed: 17 November 2010).
20. MessageSolution, Inc. Compliance - Regulations Overview. Available at: http://www.messagesolution.com/resources.htm#frcp (no date) (Accessed: 17 November 2010).
21. CA Technologies. (2010) The Avoidable Cost of Downtime, CA Technology Research Report. Available at: http://www.arcserve.com/files/supportingpieces/acd_report_100908_244254.pdf (Accessed: 5 November 2010).
22. University of Miami Leonard M. Miller School of Medicine. Data retention: Selected requirements by data type – Information Technology – Miller School of Medicine at the University of Miami. Available at: http://it.med.miami.edu/x1312.xml (no date) (Accessed: 27 November 2010).
23. http://www.metricstream.com/solutions/ferc_compliance.htm (no date) (Accessed: 27 November 2010).
24. Raffo, D. (2010) Disaster recovery/business continuity planning top spending priority, according to Forrester. Available at: http://searchstorage.techtarget.co.uk/news/article/0,289142,sid181_gci1519940,00.html (Accessed: 27 November 2010).
25. Lyman, P. (2002) ‘Archiving the World Wide Web’, Building a National Strategy for Digital Preservation, April. Available at: http://www.clir.org/pubs/reports/pub106/web.html (Accessed: 8 November 2010).
26. FaceTime Communications, Inc. Instant Messaging and Social Media Regulatory Requirements. Available at: http://www3.facetime.com/solutions/regulatoryrequirements.aspx (no date) (Accessed: 27 November 2010).
27. ALSTON + BIRD LLP. What are the HIPAA rules? Available at: http://www.klsecurity.com/hipaa_regulation.htm (no date) (Accessed: 27 November 2010).
28. Moffet, B.L. (2004) Timeframes for Document Retention. Available at: http://www.gfrlaw.com/pubs/GordonPubDetail.aspx?xpST=PubDetail&pub=96 (Accessed: 3 December 2010).
29. Breaux, T.D. (2008) ‘Analyzing Regulatory Rules for Privacy and Security Requirements’, IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, VOL. 34, NO. 1, JANUARY/FEBRUARY 2008 [Online]. Available at: http://www.computer.org/portal/web/csdl/doi/10.1109/TSE.2007.70746 (Accessed: 6 December 2010).
30. Swartz, N.. (2007). ‘UK Stores Phone Data for One Year’, Information Management Journal 41, no. 6: 17. Business Source Complete, EBSCOhost [Online]. Available at: http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=27569860&site=bsi-live (Accessed: 1 November 2010).
31. Lee, J.J. (2005) ‘The Compliance Imperative: Managing Record Retention in a Rapidly Changing Regulatory Environment’, DM Review, 15(6), pp. 38-41. ABI/INFORM Trade & Industry [Online]. Available at: : http://proquest.umi.com/pqdweb?did=1381419601 (Accessed: 6 December 2010).
32. Watson Hall Ltd. UK Data Retention Requirements. Available at: https://www.watsonhall.com/methodology/uk-data-retention-requirements.pl (no date) (Accessed: 6 December 2010).
33. Preimesberger, C. (2008) ‘ON THE BRINK OF DISASTER. (cover story).’, eWeek, 25, no. 22, pp. 31-38. Business Source Complete, EBSCOhost [Online]. Available at: http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=33986023&site=bsi-live (Accessed: 1 November 2010).
34. Nettleton, E., and Watts, M. (2007) ‘Assessing the costs of data retention in the UK’, Journal of Database Marketing & Customer Strategy Management, 15, no. 1, pp. 56-59. Business Source Complete, EBSCOhost [Online]. Available at: http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=30050546&site=bsi-live (Accessed: 1 November 2010).
35. Lior, A. (2010) ‘Protecting Critical Data’, Dataquest, November 2. Factiva [Online]. Available at: http://global.factiva.com (Accessed: 6 December 2010).
36. Smith, E.M. (2009) ‘Storage management: What radiologists need to know’, Applied Radiology, Volume 38, Issue 5. Factiva [Online]. Available at: http://global.factiva.com (Accessed: 2 November 2010).
37. Mitchell, D.R. and Mitchell, J.A. (2006) ‘Status of clinical gene sequencing data reporting and associated risks for information loss’, Journal of Biomedical Informatics, 40 (2007), pp. 47-54. ScienceDirect [Online]. Available at: http://www.sciencedirect.com (Accessed: 4 November 2010).
38. Howard, C.E. (2009) ‘Secure information storage: military and aerospace equipment designers have a wealth of online options available to fill the ever-increasing need for secure data storage in the field’, Military and Aerospace Electronics, Volume 20, Issue 12. Factiva [Online]. Available at: http://global.factiva.com (Accessed: 4 November 2010).
39. Naccarati, P. (2003) Data Storage: Managing Obsolescence, Adtron Marketing, June. Available at: http://www.adtron.com/pdf/AdtronMO030606.pdf (Accessed: 3 November 2010).
40. Harris, R. and Olby, N. (2001) ‘Earth observation data archiving in the USA and Europe’, Space Policy, 17 (2001), pp. 35-48. ScienceDirect [Online]. Available at: http://www.sciencedirect.com (Accessed: 4 November 2010).
41. Scott, R.E. (2007) ‘e-Records in health – Preserving our future’, International Journal of Medical Informatics, 76 (2007), pp. 427-431. ScienceDirect [Online]. Available at: http://www.sciencedirect.com (Accessed: 4 November 2010).
42. Ball, A., Ding, L., and Patel, M. (2008) ‘An approach to accessing product data across system and software revisions’, Advanced Engineering Informatics, 22 (2008), pp. 222-235. ScienceDirect [Online]. Available at: http://www.sciencedirect.com (Accessed: 4 November 2010).