8/11/2019 Ipaw113 Erdem.ppt http://slidepdf.com/reader/full/ipaw113-erdemppt 1/40 Data Warehouse Governance Simplification with Informatic Yapi Kredi Bank Presented by Ahmet Vefa Erdem 14.05.2014
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 1/40
Data Warehouse GovernanceSimplification with Informatic
Yapi Kredi Bank
Presented by Ahmet Vefa Erdem
14.05.2014
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 2/40
Agenda
About Yapı Kredi Bank
Business Problem/ IT Challenge
Selecting Data Integration Technology
Solutions in Yapı Kredi
Future Plans and Vision
2- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 3/40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 4/40
Agenda
About Yapı Kredi
Business Problem/ IT Challenge
Selecting Data Integration Technology
Solutions in Yapı Kredi
Future Plans and Vision
4- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 5/40
New Core
Banking
System
2010
Re
for
Br
2
Campaign
Mangment
2001
Data
Mining,Customer
Segment
2002
DWH
DWH
1999
DW
Transform
Project
2003
Merger
between
YapıKredi
and
KocBank
2006
DW
Assessmnt
Project
2011
Developing Data Warehouse in YapıKredi Continous improvement in BI and Data Warehouse at YapiKre
5- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 6/40
Big Changes are Big Challenges for Data Wareho
New Core Banking System
New Credit Card System
New Credit Risk Underwriting
New Treasury System
Bank Merger
(Koçbank & YapıKredi )
New Collection System
6- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 7/40
Data Warehouse Life Story in 15 years
Core
BankingCreditCard ATM Others
Internet
Banking
Data Warehouse
Reporting
ProfitabilityDMart
MIS
Profitability
Business
Intelligence
ODS
ETL
Merchant
Reporting
Credit Card
DMarts
FraudDMart
Cube
CreditCard
Cartography
CreditCard
Branch
Reporting
Operations
Dashboard(Opmis)
Branch
Reporting
Center
Campaign
DMart
Campaign
Management
(Chordiant)
CRMDMart
Potential
Customer
Management
Individual
Banking CRM
CRMPrivate Banking
CRM
Corpotate
Commercial
Banking CRM
Opportun
Manageme
Presentation DataMartsETL
7- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 8/40
Dmarts
(9TB)
ELTELT
Source1
Source2
Source..
Extraction
Enterprise BI, Reporting
Db Link
Replication
DWH
(75TB)ODS
(20TB)
Sizing and Usage Load Figures of Data Warehous
#of users
5,000#of tables
17,000#of
query/day
27,000
#of ETL
jobs
15,000
Size of
accumulated
data/day
8TB
#of
processed
rows/day
90 billions
#of
query/day
90 millions
8- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 9/40
• Lower user satisfaction
• Lower performance
• High development & maintenance
• Poor Integration capability
• Poor scalability
• Compatibility problems with 3rd p
• Data quality issues
Problems knocked on the door for Data Warehou
9- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 10/40
Diagnose & Cure:
2011 : Data Warehouse Assessment Study
• Created 5 years prog
Data Warehouse and
Assessment study findings are about;
• Data & process governance
• Data integration ( ETL )
• Data model
• Architecture
• Data quality
• Metadata management
10- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 11/40
Agenda
About Yapı Kredi
Business Problem/ IT Challenge
Selecting Data Integration Technology
Solutions in Yapı Kredi
Future Plans and Vision
11- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 12/40
GOAL :
• Select an integration and data ma
tool for data warehouse and its ec
to solve major data management
adressed at the assessment stud
• Then integrate and extend it into tdomains to reinforce enterprise d
integration and governance polici
Selecting Data Integration Technology
12- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 13/40
• We evaluated worldwide leading Technol
Integration and Data Management Soluti
• Informatica
• IBM Data Stage
• Oracle ODI
• Abinitio
• SAP Business Object Data Services
• Evaluation team was established from ;
• DW Development Team• DW Administation Team
• DW ETL Operation Team
• Enterprise Data Architecture Team
Selecting Data Integration Technology
13- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 14/40
• Functional Requirements were;• Data Integration Tool ( ETL )
• Metadata Management
• Data Archiving and Data Federation
• Test Data Management ( TDM )
• Data Profiling and Data Quality
• Big Data Integration Capability
• Intregration and compatibility require
• Compatibility with Sybase IQ and O• Ability to integrate with Power Desig
Modelling Tool
• Ability to make data lineage analysi
table to Business Objects Reports
Selecting Data Integration Technology
14- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 15/40
After 6 months evaluation w
Selecting Data Integration Technology
15- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 16/40
Using Informatica
Metadata
ManagerMetadata Manager
Data ArchivingD
Data Subset D
Governance
Information Lifecycle
Management
Test Data Management
Power CenterData Integration(ETL)
16- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 17/40
Agenda
About Yapı Kredi
Business Problem/ IT Challenge
Selecting Data Integration Technology
Solutions in Yapı Kredi
Future Plans and Vision
17- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 18/40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 19/40
Architecture
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 20/40
ArchitectureInformatica Components in Data Warehouse
ETLETLODS
Source1
Source2
Sourcen
DWH
Enterprise BI, Reporting
Informatica Power Exchange
Archives
Informatica Metadata Manager, Informatica Data Services
TestData
DMart
DMart
DMart
Extraction
Db Link
Replication
Metadata
Manager
20- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 21/40
ETL DevelopmentDefining Methods and Standarts
• Consultancy for ETL Development Best Practices from KO
• ETL development lifecycle standarts
• Documentation standarts
• Deployment methods and standarts
• Security and administration standarts
• Development best practices
• Performance tuning tips
* is the distributor and leader solution provider of Informatica
21- 40
ETL Development
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 22/40
ETL DevelopmentProductivity with Informatica Power Center
• We have reached a certain level of ETL development experiences
• Number of Project : 20
• Number of Developer : 55
• Number of Project Team :4
• Number of ETL Mapping : 1958
• Number of Workflow : 1428
22- 40
ETL D l t B fit f U i I f ti
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 23/40
Declarative Design
Define How: Built-in Templates
DefineWhat
YouWant
AutomaticallyGenerateDataflow
1 2
Conventional Hand coded ETL• Must define every step of Complex ETL Flow
• Requires specialized ETL skills• Significant development and maintenance e
Conventional ETL Design
ETL Development - Benefits from Using Informatica
Declarative Set-based Design with Info• Simplifies ETL development process
• Significantly reduce the learning curve
• Shorter implementation duration
• Enforcement of best practices and standard
• Provides Data lineage and impact analysis
• Doesn’t require specialized low level progra
23- 40
ETL D l t B fit f U i I f ti
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 24/40
Conventional Hand coded ETL• Difficult to organize of a large number o• Developer was not aware of a similar fu
already• There was an aversion to change and tdata sets and jobs are added rather tharefactoring the existing jobs
ETL Development - Benefits from Using Informatica
Declarative Set-based Design with Info
• Automatically generates the Data Flow
sources and target DB
• Reduce maintenance efforts
• Provides full functionality of ETL Autom
ILM, TDM, MM )
24- 40
ETL Development Benefits from Using Informatica
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 25/40
Source
TableFTP Load Transform Validate
Informatica
Mapping
Old ETL Jobs per Table
New ETL Job
per Table
Extract
• Decrease in UC4 jobs
• 4,160 Jobs were deleted in UC4 job scheduler ( 1040 ODS tables
• increase operational efficiency
Job Reduction
ETL Development - Benefits from Using Informatica
25- 40
Test Data Management with Informatica TDM
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 26/40
Test Data Management with Informatica TDMBefore Informatica
Db1
Db2
Db..n
Db1
Db2
Db..n
Operational
Systems (Live)
Operational
( Test & De
DWH
Live
Admin for Test Data
Preparation
DWH
Test &
Dev
IBM Optim was
used as
Enterprise Test
Management Tool
But it could not be used
for DW because of
incompatibility with
Sybase IQ
For DWH, database
administrators were
preparing test data26- 40
Test Data Management with Informatica TDM
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 27/40
est ata a age e t t o at caAfter Informatica ( for DWH Test Management)
Db1
Db2
Db..n
Db1
Db2
Db..n
Operational
Systems (Live)
Operational
( Test & De
DWH
Live
DWH
Test
DWH
Dev
Test Data Subset
Persistent Data
Masking
Informatica TDM was selected by
Data Warehouse Team because of its
rich functionality, high performance
and high compatibility with Sybase IQ
27- 40
Test Data Management with Informatica TDM
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 28/40
gAfter Informatica ( for Enterprise )
Db1
Db2
Db..n
Operational
Systems (Live)
Db1
Db2
Db..n
Operational
( Test & Dev
DWH
Live
DWH
Test
DWH
Dev
Test Data Subset
Persistent Data
Masking
Replaced IBM Optim with Informatica TDM.
Informatica TDM became «Enterprise Test Data
Management Solution» for Yapı Kredi
28- 40
Using Informatica Data Archiving and Data Virtua
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 29/40
Using Informatica Data Archiving and Data Virtua
• In last 3 years, each year d
extended between %25-40
Warehouse
• High cost, because of usinexpensive storage systems
Warehouse.
• We decided to use data ar
under control size and cost
• We archived 10TB historica
DW data in 2013
• Now, users can access the
using Informatica DataVirtu
• Planing ~10TB archive in 2
RDBMS Compressed
Informatica
Archive File
29- 40
Informatica Metadata Manager in Enterprise Arc
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 30/40
o at ca etadata a age te p se c
Db1
Db2
Db..n
Operational
Systems
Source Systems
Metadata
DataWarehouse
Metadata
DWH
ODS
DMarts
End-to-end impact analysis
MetadataManager
MetadataManager
30K
Tables
17K
Tables
30- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 31/40
Vi i
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 32/40
Vision
• Having efficient, flexible, accurat
Data Warehouse solution
• Using Big Data technologies and
performance analytics methods i
• Empower and generalize Data G
policies into the company by usin
• Use Informatica platform as ente
integration and data managemen
32- 40
Plan :Evolve Data Warehouse
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 33/40
Actions 2011 2012 2013 2014
Make
Assessment
AdaptOrganizatio
n
Modernize
Technology
& Methods
DevelopSystems
Govern
Systems
Sybase Assesment
ETL : Start using
Informatica PC
ChangeOrganization
DW Assessment Study Data Mining Assessment
Give Trainings :TDWI , Power Designer,Informatica, SQL
Define & givedomain architectrole
Select ETL
Tool:Informatica
Change DWDevelopmentLifecycle
Start Using Exadata
for ODS & Datamarts
Upgrade DW:
Sybase IQ 15.4
Select new DW
Platform
Simplify DWStructure
ILM: Start using
Informatica ILM
TDM: Start using
Informatica TDM
Db Design : Startusing PowerDesigner
DW Re-structurin
Start UsingInformatica MetadataManager
Integrate PowerDesigner/Informatica MM /Deployment tool
Create as-is datamodels in PowerCenter
33- 40
Data Warehouse Simplification
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 34/40
Data Warehouse Simplification
• Data Warehouse Re-Engineering
• Platform Change ( Depends on POC results )
• Model Simplification
• Decrease number of tables
• Database consolidation between ODS + DW + Data Marts
• Near-realtime Data Warehouse
• Re-design ETL jobs with Informatica ( 15,000 jobs )
34- 40
As-is Data Warehouse
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 35/40
Data Warehouse
ProfitabilityDMart FraudDMart D.MiningDMart
Core
Banking
Internet
Banking ATM CreditCard
ODS
Logical
DataWarehouse
Campaign
Management
(Chordiant)
Individual
Banking CRMPrivate Banking
CRM
Corpotate
Commercial
Banking CRM
Opportun
Manageme
Potential
Customer
Management
Reporting
MIS
Profitability
Merchant
Reporting
CreditCard
Cartography
CreditCard
Branch
Reporting
Operations
Dashboard
(Opmis)
Operational Systems
Campaign
DMart
Credit Card
DMarts
PresentationDataMartsBranch
Reporting
CenterCRM DMart
OthersBusiness
Intelligence
CRM
ETL
ETL
OperationalData Store
ETL
35- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 36/40
Data Warehouse Simplification
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 37/40
Data Warehouse Simplification
• Business Intelligence Re-Engineering
• Upgrade and migrate Business Objects from BOXI 3.1 into BO
• Universe consolidation ( More than 200 Universes )
• Report consolidation (12K active reports among 80K reports )
37- 40
Big Data
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 38/40
Big Data
• We are in the learning phase
•Planning to become Big Data Enabl
• Starting establish Hadoop platform
• Planinng to define use cases in ;
• IT of things
• Social CRM
• N-Path analysis for customer churn
• Risk management
38- 40
Social CRM
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 39/40
Social CRM
• Yapi Kredi is one of the most act
social media in the country
• Defining Social CRM strategies
• Working on Customer Matching
• Planning Complex Event Proces
• Planing Location Based Services
39- 40
8/11/2019 Ipaw113 Erdem.ppt
http://slidepdf.com/reader/full/ipaw113-erdemppt 40/40
Ahmet Vefa ErdemDataWarehouse & Data Mining Developm
Yapi Kredi Bank
40- 40