From Print to the Cloud and Beyond: The Story of a Century Old Company and its Resiliency to Ever-Evolve
Post on 16-Jul-2015
105 Views
Preview:
Transcript
From Print to the Cloud and Beyond
The Story of a Century Old Company and its Resiliency to Ever-Evolve
Agenda
CAS Overview
CAS - In the Beginning… There was Print
CAS - The Age of Silos
CAS - IBM Integration. To the Cloud… and Beyond
Future Considerations
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.2
Agenda
CAS Overview
CAS - In the Beginning… There was Print
CAS - The Age of Silos
CAS - IBM Integration. To the Cloud… and Beyond
Future Considerations
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.3
CAS helps scientists around the world benefit from the published
work of their colleagues by monitoring, abstracting and indexing the
world's chemistry-related literature
CAS has been supporting scientists for more than 100 years
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.4
Since 1907, CAS’s objective
has been to find, collect, and
organize all publicly disclosed
chemistry substance
information
CAS helps scientists around the world benefit from the published work of their colleagues
CAplusSM
CAS REGISTRYSM
CHEMLIST®
CIN®
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.5
Markush
Indexing
Authority
Processing
Source
Selection
Document
Indexing
Reaction
Indexing
MARPAT®
CHEMCATS®
CAS scientists monitor, abstract and index the world's chemistry-
related literature
Proprietary, standardized indexing in CAS databases ensures
consistent, comprehensive search results.
CASREACT®
CAS products and services make it faster and easier for scientist to find the information they need for their research
CAS Registry Numbers® uniquely identify each
chemical substance without the ambiguity of multiple
naming conventions
STN® combines industry-leading search and retrieval
with unique and comprehensive content
SciFinder® offers a one-stop shop experience with
flexible search and discover options based on user
input and workflow
Science IP®, the CAS information search service
provides fast, comprehensive and accurate searches
of the world’s scientific and technical literatureCAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.6
CAS Registry Number 58-08-2
CAFFEINE!
Agenda
CAS Overview
CAS - In the Beginning… There was Print
CAS - The Age of Silos
CAS - IBM Integration. To the Cloud… and Beyond
Future Considerations
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.7
CAS Timeline108 Years of Progress (and Counting)
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.8
CAS End-To-End Architecture“In the Beginning… There was Print”
Data
Transformation
Data Validation
Data CurationData Integration
Data Presentation
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.9
Data Ingestion
“CAS Knows Jack”Jack and Friends Beside Printed Chemical Abstracts
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.10
Agenda
CAS Overview
CAS - In the Beginning… There was Print
CAS - The Age of Silos
CAS - IBM Integration. To the Cloud… and Beyond
Future Considerations
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.11
Data Ingestion Data Transformation Data Validation Data Normalization Data Persistence
CAS End-To-End Architecture“The Age of Silos”
Data Ingestion Data Transformation Data Validation Data Curation Data Integration Data Persistence
Data Transformation Data Validation Data Integration Data Presentation
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.12
Silo Challenges
Multiple Data Ingestion Points– In some cases, the same data is being ingested twice
Multiple Views of the Data– Each silo must perform complex transformations to its specific view
– Editorial manufactures normalized data based on a print model
– Product Development wants de-normalized, complete data
– Content Delivery has a mixed view of the data
Multiple Vocabulary Conventions– Differing data definitions causes confusion across silos
No Unified, Authority Data Store– Each silo has their own copy of the data in its own specific vocabulary
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.13
Editorial Legacy Systems
Many disparate databases used to store relational data– Becomes difficult to maintain and support
Multiple database technologies used– No unified platform
Challenges to support legacy systems– Some legacy technologies are no longer supported
– Succession planning difficult to support legacy systems
– Special IT used so that legacy code would not need to be touched
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.14
Content Delivery Systems
Data was transformed into one common data model to bridge
gap between Editorial and Product View– One common schema model was complex and unwieldy
– Common model contained “unnecessary” complexities
– Common model did not align with Product Development’s specifications
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.15
Product Development Systems
Product Development must code for “unnecessary” complexities
Data not completely de-normalized– Additional development necessary to compile data
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.16
Silo Challenges
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.17
By the Numbers
Thousands of journals ingested per day– Approximately 1 TB of data per week
Over 100 other data feeds ingested per day
Over 1.2 million messages processed per day– Synced up with product data daily in less than 10 minutes
Over 6 TB of compiled data created per day
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.18
What is an Architect to Do?
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.19
Unify…Integrate…Simplify
Unify Data; Processes; Transformations; Data Ingestion
Integrate Disparate Systems; Services; Applications; and
Data Consumers
Simplify the Architecture!!!
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.20
• Run proof-of-concept and/or proof-of-technology and/or pilot project as needed
• Negotiate contract
• Adjust as needed
• Selection team members score vendor solutions
• Aggregate scores
• Select vendor with best aggregate score (judgement required)
• Bake-off if winner is too close to call
• Send RFP document to prospective vendors
• Hold clarification meetings with vendor teams
• Vendors send RFP response documents
• Vendors present their solutions and answer questions
• Create technology selection team
• Identify key requirements (based on architecture and tech stack governance)
• Assign weights
• Create RFP document and scorecard spreadsheet
Request For Proposal
Create RFP
Engage vendors
Score-driven selection
Validate selection
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.
Requirements
Data Integration
Durable Message Bus with Guaranteed Delivery
Any-to-Any Connectivity
Architectural Flexibility
Excellent Support
A Proven Solution
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.22
Agenda
CAS Overview
CAS - In the Beginning… There was Print
CAS - The Age of Silos
CAS - IBM Integration. To the Cloud… and Beyond
Future Considerations
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.23
Unify…Integrate…Simplify
Data Curation
Data Ingestion Data Transformation Data Validation Data Normalization Data Integration
Data Transformation Data Validation Data Integration Data Presentation
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.24
Data Persistence Data Flow Orchestration
Agenda
Overview
CAS - In the Beginning… There was Print
CAS - The Age of Silos
CAS - IBM Integration. To the Cloud… and Beyond
Future Considerations
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.25
To the Cloud… and Beyond!
Off-Prem Processing Bursting Capabilities Data Center Relief Co-Location Capabilities
New Mobile Applications
Service Unification Service Management Service Integration
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.26
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.27
Questions
CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.28
Connect with CAS:
Joseph Sapp
Lead Enterprise Application Architect
jsapp@cas.org
www.linkedin.com/in/joesapp
top related