The Importance of Reference & MDM Eternal Management of the Data Mind Peter Aiken, Ph.D. Copyright 201 8 by Data Blueprint Slide # • DAMA International President 2009-2013 • DAMA International Achievement Award 2001 (with Dr. E. F. "Ted" Codd • DAMA International Community Award 2005 Peter Aiken, Ph.D. • 33+ years in data management • Repeated international recognition • Founder, Data Blueprint (datablueprint.com) • Associate Professor of IS (vcu.edu) • DAMA International (dama.org) • 10 books and dozens of articles • Experienced w/ 500+ data management practices • Multi-year immersions: – US DoD (DISA/Army/Marines/DLA) – Nokia – Deutsche Bank – Wells Fargo – Walmart – … 2 Copyright 201 8 by Data Blueprint Slide #
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
The Importance of Reference & MDM
Eternal Management of the Data Mind
Peter Aiken, Ph.D.
Copyright 2018
by Data Blueprint
Slide #
• DAMA International President 2009-2013
• DAMA International Achievement Award 2001 (with Dr. E. F. "Ted" Codd
• DAMA International Community Award 2005
Peter Aiken, Ph.D.• 33+ years in data management • Repeated international recognition • Founder, Data Blueprint (datablueprint.com) • Associate Professor of IS (vcu.edu) • DAMA International (dama.org) • 10 books and dozens of articles • Experienced w/ 500+ data
management practices • Multi-year immersions:
– US DoD (DISA/Army/Marines/DLA)– Nokia – Deutsche Bank– Wells Fargo – Walmart– …
2Copyright 2018
by Data Blueprint
Slide #
Data Assets Win!
Data Assets
Financial Assets
RealEstate Assets
Inventory Assets
Non-depletable
Available for subsequent
use
Can be used up
Can be used up
Non-degrading √ √ Can degrade
over timeCan degrade
over time
Durable Non-taxed √ √
Strategic Asset √ √ √ √
Data Assets Win!• Today, data is the most powerful, yet underutilized and poorly
managed organizational asset • Data is your
– Sole – Non-depletable – Non-degrading – Durable – Strategic
• Asset – Data is the new oil! – Data is the new (s)oil! – Data is the new bacon!
• As such, data deserves: – It's own strategy – Attention on par with similar organizational assets – Professional ministration to make up for past neglect
3Copyright 2018 by Data Blueprint Slide #
Asset: A resource controlled by the organization as a result of past events or transactions and from which future economic benefits are expected to flow [Wikipedia]
Copyright 2013 by Data Blueprint
• Data Management Overview
• What is Reference and MDM?
• Why is Reference and MDM important?
• Reference & MDM Building Blocks
• Guiding Principles & Best Practices
• Take Aways, References & Q&A
Unlocking Business Value Through Reference & Master Data Management
4
UsesUsesReuses
What is data management?
5Copyright 2018 by Data Blueprint Slide #
Sources
Data Engineering
Data Delivery
Data
Storage
Specialized Team Skills
Data Governance
Understanding the current and future data needs of an enterprise and making that data effective and efficient in supporting business activitiesAiken, P, Allen, M. D., Parker, B., Mattia, A., "Measuring Data Management's Maturity: A Community's Self-Assessment" IEEE Computer (research feature April 2007)
Data management practices connect data sources and uses in an organized and efficient manner • Engineering • Storage • Delivery • Governance
When executed, engineering, storage, and delivery implement governance
Note: does not well-depict data reuse
Data Management
6Copyright 2018 by Data Blueprint Slide #
Sources
Data Engineering
Data Delivery
Data
Storage
Specialized Team Skills
Resources
(optimized for reuse)
Data Governance
Ana
lytic
Insi
ght
Specialized Team Skills
Copyright 2018
by Data Blueprint
Slide #
Maslow's Hierarchy of Needs
7Copyright 2018
by Data Blueprint
Slide #
You can accomplish Advanced Data Practices without becoming proficient in the Foundational Data Management Practices however this will: • Take longer • Cost more • Deliver less • Present
greaterrisk(with thanks to Tom DeMarco)
Data Management Practices Hierarchy
Advanced Data
Practices • MDM • Mining • Big Data • Analytics • Warehousing • SOA
Foundational Data Management Practices
8
Copyright 2018 by Data Blueprint
Data Platform/Architecture
Data Governance Data Quality
Data Operations
Data Management Strategy
Technologies
Capabilities
Copyright 2018
by Data Blueprint
Slide #
DMM℠ Structure of 5 Integrated DM Practice Areas
Data architecture implementation
Data Governanc
e
Data Manageme
ntStrategy
Data Operations
PlatformArchitectur
e
SupportingProcesses
Maintain fit-for-purpose data, efficiently and effectively
9Copyright 2018
by Data Blueprint
Slide #
Manage data coherently
Manage data assets professionally
Data life cycle management
Organizational support
Data Quality
Copyright 2018
by Data Blueprint
Slide #
Data Management Strategy is often the weakest link
Data architecture implementation
Data Governanc
e
Data Manageme
ntStrategy
Data Operations
PlatformArchitectur
e
SupportingProcesses
Maintain fit-for-purpose data, efficiently and effectively
10Copyright 2018
by Data Blueprint
Slide #
Manage data coherently
Manage data assets professionally
Data life cycle management
Organizational support
Data Quality
3 3
33
1
Copyright 2013 by Data Blueprint
The DAMA Guide to the Data Management Body of Knowledge
11
Data Management Functions
Published by DAMA International • The professional
association for Data Managers (40 chapters worldwide)
DMBoK organized around • Primary data
management functions focused around data delivery to the organization
• Provide context for transactions • From the term "Master File"
Master Data Management Definition
15Copyright 2018 by Data Blueprint
Wikipedia: Golden Version• In software development:
– The Golden Master is usually the RTM (Released to Manufacturing) version, and therefore the commercial version. It represents the development stage of "RTM" (Released To Manufacturing), often referred to as "going gold", or "gone golden".
– Often confused with "gold master" which refers to a physical recording entity such as that sent to a manufacturing plant.
• In data management:
– It is the data value representing the "correct" answer to the business question
• Definition-Reference/Master Data Management
– Planning, implementation and control activities to ensure consistency with a "golden version" of contextual data values.
16Copyright 2018 by Data Blueprint Slide #
Definition: Reference Data Management• Control over defined domain values (also known as
vocabularies), including:
• Control over standardized terms, code values and other unique identifiers;
• Business definitions for each value, business relationships within and across domain value lists, and the;
• Consistent, shared use of accurate, timely and relevant reference data values to classify and categorize data.
17Copyright 2018 by Data Blueprint Slide #
Copyright 2013 by Data Blueprint
Reference Data
• Reference Data: – Data used to classify or categorize other data, the value
domain
– Order status: new, in progress, closed, cancelled
Control over master data values to enable consistent, shared, contextual use across systems, of the most accurate, timely and relevant version of truth about essential business entities.
19
Copyright 2013 by Data Blueprint
Master Data• Data about business entities providing context
for transactions but not limited to pre-defined values
• Business rules dictate format and allowable ranges – Parties (individuals, organizations, customers,
citizens, patients, vendors, supplies, business partners, competitors, employees, students)
Unlocking Business Value Through Reference & Master Data Management
22
Copyright 2013 by Data Blueprint
Reference Data Facts 2012
• Home-grown reference data solutions predominate, putting institutions at risk for meeting regulatory constraints
• Risk management is seen as a more important business driver for improving data quality than cost
23
Source: http://www.igate.com/22926.aspx
• Global industry-wide survey of reference data professionals
• Results show: Poor quality of reference data continues to create major problems for financial institutions.
Copyright 2013 by Data Blueprint
Reference Data Facts 2012, cont’d• Despite recommended practices of centralizing
reference data operations, 31% of the firms surveyed still manage data locally
• New and changing regulatory requirements have prompted many financial service companies to re-evaluate their reference data strategies. To prepare for new regulations, nearly 62% of survey respondents are planning to extend or customize their reference data systems during 2012 and 2013.
24
Source: http://www.igate.com/22926.aspx
Copyright 2013 by Data Blueprint
Interdependencies
25
Data Governance
Master Data Data Quality
interdependencies
26Copyright 2018 by Data Blueprint Slide #
Data Governance
Master Data Data Quality
makes the case and is
responsible for
is a necessary but insufficient prerequisite
to success
MD capabilities constrain governance
effectiveness
Solution Framework
27Copyright 2018 by Data Blueprint Slide #
SORs
SOR 1
SOR 2
SOR 3
SOR 4
SOR 5
SOR 6
SOR 7
SOR 8
Repository
IndicatorExtraction
Service (could be
segmented byday of week
month, system, etc.)
UpdateAddresses
LatencyCheckService
Ch 1
Ch 2
Ch 3
Ch 4
Ch 5
Ch 6
Channels
Ch 7
Ch 8
External Address Validation Processing
CustomerContact
Copyright 2013 by Data Blueprint
Inextricably intertwined
28
Organized Knowledge 'Data'
Improved Quality Data
Data Organization Practices
Operational Data
Data Quality Engineering
Master Data Management
Practices
Suspected/ Identified
Data Quality
Problems
Routine Data Scans
Master Data Catalogs
Routine Data Scans
Knowledge Management
Practices
Data that might benefit from Master Management
Sources( (Metadata(Governance(
(
Metadata(Engineering(
(
Metadata(Delivery( Uses(
Metadata(Prac8ces((dashed lines not in existence)
Metadata(Storage(
Copyright 2013 by Data Blueprint
Interactions
29
Improved Quality Data
Master Data
Monitoring
Data Governance
Practices
Master Data Management
Practices
Governance Violations Monitoring
Data Quality Engineering
Practices
Data Quality
Monitoring
Monitoring Results:
Suspected/ Identified
Data Quality
Problems Data Quality Rules
Monitoring Results:
Suspected/ Master Data &
Characteristics
Routine Data
Scans
Master Data
Catalogs
Governance Rules
Routine Data
Scans
Monitoring Rules
Focused Data
Scans
Operational Data
Data Harvesting
Quality Rules
Copyright 2013 by Data Blueprint
Payroll Application(3rd GL)Payroll Data
(database)
R& D Applications(researcher supported, no documentation)
R & D Data (raw) Mfg. Data
(home grown database)
Mfg. Applications(contractor supported)
Finance
Data (indexed)
Finance Application(3rd GL, batch
system, no source)
Marketing Application(4rd GL, query facilities, no reporting, very large)
Marketing Data
(external database)
Personnel App.(20 years old,
un-normalized data)
Personnel Data
(database)
30
Multiple Sources of (for example) Customer Data
Copyright 2013 by Data Blueprint
Vocabulary is Important-Tank, Tanks, Tankers, Tanked
• Understand Reference and Master Data Integration Needs
• Identify Master and Reference Data Sources and Contributors
• Define and Maintain the Data Integration Architecture
• Implement Reference and Master Data Management Solutions
• Define and Maintain Match Rules • Establish “Golden” Records • Define and Maintain Hierarchies and Affiliations • Plan and Implement Integration of New Data Sources • Replicate and Distribute Reference and Master Data • Manage Changes to Reference and Master Data
Data Requirements • Data Models and Documentation • Reliable Reference and Master Data • "Golden Record" Data Lineage • Data Quality Metrics and Reports
Consumers: • Application Users • BI and Reporting Users • Application Developers and
Architects • Data integration Developers and
Architects • BI Vendors and Architects • Vendors, Customers and Partners
Participants: • Data Stewards • Subject Matter Experts • Data Architects • Data Analysts • Application Architects • Data Governance Council • Data Providers • Other IT Professionals
Suppliers: • Steering Committees • Business Data Stewards • Subject Matter Experts • Data Consumers • Standards Organizations • Data Providers
Applications • Data Modeling Tools • Process Modeling Tools • Meta-data Repositories • Data Profiling Tools • Data Cleansing Tools • Data Integration Tools • Business Process and Rule Engines • Change Management Tools
Copyright 2013 by Data Blueprint
• Data Management Overview
• What is Reference and MDM?
• Why is Reference and MDM important?
• Reference & MDM Building Blocks
• Guiding Principles & Best Practices
• Take Aways, References & Q&A
Unlocking Business Value Through Reference & Master Data Management
49
Copyright 2013 by Data Blueprint
Guiding Principles1. Shared R/M data belong to
the organization. 2. R/M data management is an
on-going data quality improve-ment program – goals cannot be achieved by 1 project alone.
3. Business data stewards are the authorities accountable at determining the golden values.
4. Golden values represent the "best" sources. 5. Replicate master data values only from golden
sources. 6. Reference data changes require formal change
March Webinar: The Importance of MDMMarch 13, 2018 @ 2:00 PM ET/11:00 AM PT
April Webinar: Data Modeling FundamentalsApril 10, 2018 @ 2:00 PM ET/11:00 AM PT Sign up at: www.datablueprint.com/webinar-schedule Enterprise Data World 2018 (San Diego) The First Year as a CDOApril 24, 2018 @ 1:30 PM ET
Upcoming Events
58Copyright 2018 by Data Blueprint Slide #
Brought to you by:
Copyright 2013 by Data Blueprint
Questions?
59
It’s your turn!
Use the chat feature or Twitter (#dataed) to submit