Migration from SharePoint to AlfrescoMigration Best Practices
1
Agenda
• Technology Services Group Overview• Migration Overview• Case Studies
– Orbitz Worldwide– United Cerebral Palsy– Web Content Migrations
• OpenMigrate Architecture Overview
2
TSG Background
• Consulting and Software Firm– Founded in 1996– Based in Downtown Chicago– Specialize in the implementation of ECM solutions and
technologies– Diverse Industry – Financial Services, Manufacturing,
Insurance, Food & Beverage, Travel, Associations, Pharmaceutical
• Alfresco Gold Partner since early 2006.– Document Management– Web Content Management
Agenda
Technology Services Group Overview• Migration Overview• Case Studies
– Orbitz Worldwide– United Cerebral Palsy– Web Content Migrations
• OpenMigrate Architecture Overview
4
Migration Overview
• Reasons for Migrations– Switching vendors– New platform infrastructure– Mergers, acquisitions, demergers– Consolidation– Bulk processing (scanning physical content)
• Migration Frequency– One time activity– Repeatable process
Every Migration is Different
5
Migration Challenges
• Ensure specific business rules are applied to the migrated data
• Migrate documents from different data sources• Reduce complexity and support expense by having a
common tool for each data source• Minimize time and management effort to set up large
migrations• Decrease time to migrate documents• Track migration progress (including audit)• Reduce effort involved in recovering from failed migrations
6
Introducing OpenMigrate
• An open source framework made up of highly configurable and extensible migration components
• A proven methodology developed to assist our clients in the migration of their content between content locations
7
Flexible Framework
8
OpenMigrate Architecture
• Open Source Framework• Configuration• High Volume Capable• Platform Agnostic
– Source/Target/OS/Database independent
This architecture reduces code and improves application quality, flexibility, testability and scalability
9
Agenda
Technology Services Group OverviewMigration Overview• Case Studies
– Orbitz Worldwide– United Cerebral Palsy– Web Content Migrations
• OpenMigrate Architecture Overview
10
Orbitz Worldwide Case Study
• Orbitz Worldwide is a leading global online travel company allowing customers to plan and book a broad range of travel products.
• Consumer brands include Orbitz, CheapTickets, ebookers, HotelClub, RatesToGo, the AwayNetwork, Asia hotels, and Orbitz for Business.
Orbitz Worldwide Case StudyThe Setup
Orbitz Worldwide Case StudyBusiness Concerns
• Define migration plan before cut-off date– How to migrate content?– Who owns the content?– What needs to be migrated?
• Maintain business continuity– Minimize downtime during migration– Provide effective document management solution– Train users
Orbitz Worldwide Case StudyTechnical Concerns
• Source– Interface with SharePoint– Content clean-up– Large amount of data (approx. 700 sites/subsites)
• Content Migration Rules– Versions– Attributes– Security– Folders
Orbitz Worldwide Case StudyBefore the Migration
Orbitz Worldwide Case StudyThe Migration
• Using OpenMigrate– Core Framework– Database Adapter
for Source• SharePoint Database
schema
– Alfresco Adapter for Target
Orbitz Worldwide Case StudyThe Source
Orbitz Worldwide Case StudyThe Target
• Alfresco adapter– Webscripts to interface with Alfresco– Target calls the webscripts
• Features– Create spaces (when needed)– Set object and aspects– Set content– Set metadata
Orbitz Worldwide Case StudyThe Target
United Cerebral Palsy Study
• United Cerebral Palsy (UCP) is the leading source of information on cerebral palsy and an advocate for the rights of persons with any disability.
• The mission of United Cerebral Palsy is to advance the independence, productivity, and full citizenship of people with disabilities through an affiliate network.
UCP Case Study
• Project to distribute digital copies of books to disabled school children– Different format (audio, pdf, auto-readers)– Collaboration among school districts to share learning
material• Live since September 2007
– First system in nation to provide this service
UCP Case StudyThe Application
• Search and Retrieval Application– Allows administrators to upload content
• .NET code base using Service Oriented Architecture (SOA) to retrieve from Repository
UCP Case StudyThe Setup
UCP Case StudyTechnical Concerns
• Source– Understand the source structure– Access to the metadata and files– Application is abstracted from content
• Content Migration Rules– Attribute Mapping– Maintaining relationships
UCP Case StudyThe Source
UCP Case StudyThe Target
• First migrate all “book” objects as spaces– Set object type– Set metadata fields
• Migrate chapters– Map the file location based on relationship– Set object type– Set metadata– Set the content
Web Content MigrationsBusiness Concerns
• Content stored in a combination of databases and HTML files– Hard to maintain– Format is inconsistent– No approval process– Pages are static
• Typically part of a site redesign
Web Content MigrationBefore the migration
Web Content MigrationThe Migration
• Migration of Content– Create the XML files to fit the standards
• Spreadsheets• Database queries/Java utilites• “by hand”
– Validated XML against XSD– Imported content and run extraction
• No metadata imported separately
Agenda
Technology Services Group OverviewMigration OverviewCase Studies
Orbitz WorldwideUnited Cerebral PalsyWeb Content Migrations
• OpenMigrate Architecture Overview
30
Background and Technical Overview
31
Alfresco Importing
• CIFS / Webdav / FTP– Good for migration with minimal business rules
• “ZIP” file import– Good if users are uploading from Alfresco
interface• ACP files
– Good if you are going from Alfresco to Alfresco
OpenMigrate Features
• Supports Standard Migration Requirements– Document type– Attribute name and value mappings– Destination location– Content/Rendition– Version tree structure
• Because most migrations are slightly different, it provides a flexible framework to easily extend the existing components to handle requirements specific to a migration
– For Example • Initiate a workflow• Transform attributes• Leverage external data sources (e.g., look up tables)• Add/Extend custom adapters (e.g., FileNet, Hummingbird)
33
Flexible Framework
• Major Migration Components– Migration Engine– Source/Target Interface– Source/Target Adapter
(e.g., Alfresco, File system, JDBC)
– Migration Queue/Log
34
• Migration Engine– Controls flow of data from
source to target– Leverages configuration
files to apply transformations/mappings
35
Flexible Framework
• Source/Target Interface– Allows the migration
engine to communicate to the source/target without understanding the specifics of the source/target implementation
– Leverages migration structure definition to know source/target
36
Flexible Framework
• Source/Target Adapter– Specific source/target
implementations (e.g., Documentum, File system, XML, Excel, JDBC)
– Leverages configuration files to manipulate data so the migration engine can process it
– Leverages migration queue to know what to extract/populate
37
Flexible Framework
• Migration Queue– Migration “To Do” list– Informs source/target
what to extract/populate
– Tracks status (e.g., success or failure)
• Migration Log– Migration “Audit” Trail– Provides detailed
mapping from source to target for every object
38
Flexible Framework
Architecture
• Open Source Framework– Java– Spring - a layered Java/J2EE application framework
• Configuration– Streamlined XML– Simplified properties files
• High Volume Capable– Multi-threaded– Flexible configuration can impact performance
• Platform Agnostic– Source/Target/OS/Database independent
This architecture reduces code and improves application quality, flexibility, testability and scalability
39
OpenMigrate Interface
• OpenMigrate Admin– On the fly configuration
• Command Line• Custom Web Application
– Batch uploading of content, metadata– Import spreadsheet
OpenMigrate Administration Interface Demonstration
TSG Open Source Solutions
• OpenMigrate– High Performance Migration tool for one-time or ongoing migrations
• OpenContent– Web Services architecture supporting multiple repository– .NET and Java
• OpenOverlay– PDF manipulation tool to allow for real-time watermarking
• Active Wizard– Dynamic Form and Workflow Application
42
The TSG Difference
• Over 13 years of experience implementing Enterprise Content Management Solutions
• 400+ ECM projects • Focus on long term client relationships
– Understanding the business needs– Providing consistent high quality solutions
• Carefully chosen Vendor Relationships• Flexible Business Model
– Phased Engagements– Cost Effective Solutions
• Contributor to the Open Source community
Download OpenMigratehttp://www.tsgrp.com/Open_Source/OpenMigrate/
open-migrate.jsp
ContactDrop us a note
Ray Wijangco
312-372-7777 x269
Naveen Aggarwal
312-372-7777 x351
Todd Pierzina
312-372-7777 x254
Questions???