Top Banner
StatistiX An Introduction 11. Oracle Business Analytics Big Data und Data Warehouse Konferenz 2016 15. und 16. März 2016
69

StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

May 02, 2018

Download

Documents

phamkhanh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX

An Introduction

11. Oracle Business Analytics Big Data und Data Warehouse Konferenz 2016 15. und 16. März 2016

Page 2: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

What do we do?

Deutsche Börse Group 1

Eurex Classic /

PNDEEXXetra / PNDEurex T7Prisma

RE / EDRE /

PNDWSS

Calypso Sec Lending

StatistiX Development

File Interface

CEF

Cash

CEF

Eurex

Eurex C7Eurex

Repo

Direct Real-Time Connection and / or File Interface

Prisma & Risk Analytics

MMPM, HypMi, PERD II

Regulatory BillingRICA & OTC

IRS Tools

External

Reports

Frontend

Reporting

Internal

Reports

Billing Input &

Reporting

Regulatory

Reporting

ODBC

Access

Ad-hoc

Analysis

Data

Vendors

Real-Time / File Interface

Business Support

EMail

EEX BDWH

StatistiX Operations

Page 3: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

What do we do?

Deutsche Börse Group 2

Eurex Classic /

PNDEEXXetra / PNDEurex T7Prisma

RE / EDRE /

PNDWSS

Calypso Sec Lending

StatistiX Development

File Interface

CEF

Cash

CEF

Eurex

Eurex C7Eurex

Repo

Direct Real-Time Connection and / or File Interface

Prisma & Risk Analytics

MMPM, HypMi, PERD II

Regulatory BillingRICA & OTC

IRS Tools

External

Reports

Frontend

Reporting

Internal

Reports

Billing Input &

Reporting

Regulatory

Reporting

ODBC

Access

Ad-hoc

Analysis

Data

Vendors

Real-Time / File Interface

Business Support

EMail

EEX BDWH

StatistiX Operations

Page 4: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

The Challenge

4

Page 5: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

IT and Business

Deutsche Börse Group 4

Options contract

A contract that, in exchange for the option

price, gives the option buyer the right, but

not the obligation, to buy (or sell) a financial

asset at the exercise price from (or to) the

option seller within a specified time period,

or on a specified date (expiration date).

What ???

int i = 1;

i += i++ + ++i;

What's the value of i?

What ???

… ROI … Time To Market … TCO …

!!! JUST DO IT !!!

Dr. Murray Bozinsky Gordon Gekko

Page 6: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

The Common Answer

Deutsche Börse Group 5

Business<< Analysis, Usage >>

Functionality<< Analysis, Specification, Test >>

IT<< Development, Operations >>

Functional

Requirements

Technical

Requirements

Software

Systems

Business Skills Technical Skills

Page 7: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

The StatistiX Philosophy

7

Page 8: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

The StatistiX Philosophy

Deutsche Börse Group 7

Business<< Analysis, Usage >>

Functionality<< Analysis, Specification, Test >>

IT<< Development, Operations >>

Functional

Requirements

Technical

Requirements

Software

Systems

Business Skills Technical Skills

Shifting

Development

towards

Business

Page 9: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Qin Dynasty(221 BC - 207 BC)

The Qin Dynasty was the first Chinese united empire

• Unification by conquest of different kingdoms

• Key success factor (simplified): Standardisation

• Technology – industrially produced easy to use weapons

• People – function based equally trained soldiers

Deutsche Börse Group 8

Transform

Many Kingdoms One United Empire

Page 10: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Efficient division of labour

Reproducible quality

Easy to produce

Effective

Two days of training

Success Factor Standardisation: Technology

Deutsche Börse Group 9

Simple Construction

Industrially manufactured

Page 11: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Success Factor Standardisation: People

Deutsche Börse Group 10

Farmer Crossbowman

Transform

Need of a huge amount of soldiers trained for different functions

Page 12: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Translation to IT

Enable Business People

to develop IT Systems

Deutsche Börse Group 11

Page 13: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

The Mission

Deutsche Börse Group 12

Be the data warehouse of Deutsche Börse AG.

Be our longtime memory.

Be storage and analytic/reporting facility of operational systems.

Be consumer of continuously increasing data.

Be reliable source system for operational systems.

Keep your old principles: high quality, flexibility, dynamic, innovative, no costs.

Page 14: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

That‘s Life!

Deutsche Börse Group 13

Page 15: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

A New Warehouse Is Born!StatistiX Rising

Deutsche Börse Group 14

Intention: Replace

user individual Excel analytics/calculation based on unconsolidated data

by

centralised, audit proof reporting environment based on common sense dataset.

Tools: Oracle Database, Informatica for ETL, MicroStrategy for reporting

Most important inbound data:

Trading data XETRA, EUREX, tick data CEF, masterdata WSS

Most important outbound data:

Typical standard reporting (turnaround, ranking, …), index data

Size: Small.

Page 16: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

A New Warehouse Is Born!StatistiX Rising

Deutsche Börse Group 15

Datacenter IDevelopment/Simulation

Datacenter IIProduction

Oracle

DB CEF

Fallback

DBAG

source

systems

Oracle

DB StatistiX

InformaticaInbound

fileserver

MicroStrategy

Outbound

fileserver

World

DBAG

CEF

MSI Web for

DBAG users

Oracle

DB StatistiX

Informatica

Inbound

fileserver

MicroStrategy

Outbound

fileserver

Oracle

DB CEF

WorldWorld

DBAG

source

systems

MSI Web for

DBAG users

Page 17: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Being A ToddlerWhat Am I Doing On This Planet? Growing, trying out capabilities, acquire new skills - evolving!

Deutsche Börse Group 16

Focus: Integrate more sources and gather more data.

Improve StatistiX infrastructure.

Improve data model.

Provide more and new services to DBAG users and DBAG customers.

Tools: Oracle Database, Informatica, MicroStrategy

New: ControlM for orchestration, Webmethods for GUI and workflow applications

Most important inbound data:

Other entities from trading systems XETRA, EUREX, tick data CEF, masterdata WSS,

outside DBAG data (e.g. Reuters)

Most important outbound data:

Enhanced reporting (turnaround, ranking, …), index data, content for DBAG public

website, customer individual and personalised standard information products

Size: Small.

Page 18: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Being A ToddlerWhat Am I Doing On This Planet? Growing, trying out capabilities, acquiring new skills - evolving!

Deutsche Börse Group 17

Datacenter IDevelopment/Simulation

Datacenter IIProduction

Oracle

DB StatistiX

InformaticaInbound

fileserver

MicroStrategy

Outbound

fileserver

Oracle

DB CEF

FallbackWorld

DBAG

source

systems

DBAG

CEF

Webmethods

ControlM

DBAG

user apps

MSI Web for

DBAG users

Oracle

DB StatistiX

InformaticaInbound

fileserver

MicroStrategy

Outbound

fileserver

Oracle

DB CEF

WorldWorld

DBAG

source

systems

ControlM

Webmethods

DBAG

user

apps

MSI Web

for DBAG

users

DBAG

source

systems

Page 19: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX As A ChildGrowing And Learning

Deutsche Börse Group 18

Intention: Integrate more sources and more data into StatistiX.

Stabilise StatistiX infrastructure, partialy refresh it.

Further improvement of data model.

Enhance set of services to DBAG users and DBAG customers.

Tools: Oracle Database, Informatica, MicroStrategy, ControlM, Webmethods

Most important inbound data:

Other entities from trading systems XETRA, EUREX, tick data CEF, masterdata WSS,

outside DBAG data (e.g. Reuters)

Most important outbound data:

Further enlargement of information product prortfolio (MMPM),

new index information products (products for other burses), content for DBAG public

website, customer individual and personalised standard information products

Size: Midsize

Page 20: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX As A ChildGrowing And Learning

Deutsche Börse Group 19

Datacenter IDevelopment/Simulation

Datacenter IIProduction

Oracle

DB StatistiX

InformaticaInbound

fileserver

MicroStrategy

Outbound

fileserver

Oracle

DB CEF

FallbackWorld

DBAG

source

systems

DBAG

CEF

Webmethods

ControlM

DBAG

user apps

MSI Web for

DBAG users

Oracle

DB StatistiX

InformaticaInbound

fileserver

MicroStrategy

Outbound

fileserver

Oracle

DB CEF

WorldWorld

DBAG

source

systems

ControlM

Webmethods

DBAG

user

apps

MSI Web

for DBAG

users

DBAG

source

systems

Page 21: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Basics On StatistiX Infrastructure

Deutsche Börse Group 20

Almost all StatistiX tools are running on dedicated hardware.

CPU based software licenses count sockets/cores of underlying hardware, not the

amount assigned to the virtual machine (ControlM, MicroStrategy, Oracle).

Tool requirements are extremly high, so they need huge amount of resources.

Either virtualisation software cannot provide this or it is not reasonable to put vm on

top of hardware, that will be used almost completly by one application (e.g. Informatica).

Some smaller tools and applications based on them, are running in virtual machines.

In the past, large applications were put to SUN/Solaris infrastructure, only Microsoft

software (MS Analysis Services) was running on windows.

Various types of SUN/Solaris machines were in use (V240 – M4000).

Page 22: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Former database server had been used together with internal disks as database storage.

In child phase of StatistiX, first techrefresh took place, mainly to replace old hardware.

Basic structures of old infrastructure were kept!

Upgrade from Oracle 8 to Oracle 9!

Basics On StatistiX Infrastructure

Deutsche Börse Group 21

Page 23: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Focus: New challenging projects:

Clearstream Infosmart (very high data volume and up to this unknown requirements

for delivery reliability, billing project, and others

Creating own tools to provide functionality missing in commercial software products

(e.g., TFP = TargetFileProcessing – later on by ControlM AFT).

Improve services and reliability.

Tools: Oracle Database, Informatica, MicroStrategy, ControlM, Webmethods

Most important inbound data:

Other entities from trading systems XETRA, EUREX, tick data CEF, masterdata WSS,

outside DBAG data (e.g. Reuters), data of Clearstream operational system Dataserver

Most important outbound data:

Enhanced reporting (turnaround, ranking, …), index data, content for DBAG public

website, customer individual and personalised standard information products,

Clearstream operational reporting suite

Size: Large.

StatistiX Teenage YearsGrow, try different things, sometimes fail and get up again.

Deutsche Börse Group 22

Page 24: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Teenage YearsGrow, try different things, sometimes fail and get up again.

Deutsche Börse Group 23

Datacenter IDevelopment/Simulation

Datacenter IIProduction

Oracle

DB StatistiX

InformaticaInbound

fileserver

MicroStrategy

Outbound

fileserver

Oracle

DB CEF

FallbackWorld

DBAG

source

systems

DBAG

CEF

Webmethods

ControlM

DBAG

user apps

MSI Web for

DBAG users

Oracle

DB StatistiX

InformaticaInbound

fileserver

MicroStrategy

Outbound

fileserver

Oracle

DB CEF

WorldWorld

DBAG

source

systems

ControlM

Webmethods

DBAG

user

apps

MSI Web

for DBAG

users

DBAG

source

systems

Page 25: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Teenage YearsGrow, try different things, sometimes fail and get up again.

Deutsche Börse Group 24

Challenge:

Continuously increasing data volume and workload bringing infrastructure to it‘s

limits.

Improve reliablity, whereas shortage of resources is obvious.

Impelmentation of desaster recovery and hardware failover are still incomplete.

Implemention of huge projects in parallel to several smaller ones, besides

operating the warehouse and maintaining it, using same staff for all of these task.

Doing major or minor upgrades on software in use.

Consequence:

Next techrefresh ahead!

Page 26: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Change from SUN/Solaris systems to x86 hardware and Red Hat Enterprise Linux

Partially upgrading tool software (either because of new OS or just using the opportunity

for cleanup and neat restart .

Focus on increasing resources (make things larger) on tool infrastructure.

Protection against hardware failure or loss of datacenter based on using existing hardware

and manual processes.

Database server with Direct Attached Storage (HP P2000). Upgrade to 11g R2. Using ASM

for storage management.

Protection of database with Oracle Data Guard and server, normaly used for simulation

database.

Next Techrefresh To The End Of Teenage Time

Deutsche Börse Group 25

Page 27: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Teenage YearsDatabase TechRefresh (Still In Use)

Deutsche Börse Group 26

Datacenter IDevelopment/Simulation

Datacenter IIProduction

Oracle

DB StatistiX

P2000

P2000

P2000

P2000

P2000

P2000

P2000

P2000

production database

standby

slow disk area

production database

primary

slow disk area

simulation database

Oracle

DB SIM

Oracle

DB PRD

standby

Oracle

DB PRD

standbyData Guard

fast disk area fast disk area

P2000

P2000

P2000

P2000

P2000

P2000

P2000

P2000

Oracle

DB StatistiX

Page 28: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Getting Operational.Do more, at once, at same time, realtime and at no costs.

Deutsche Börse Group 27

Focus: Change StatistiX from data warehouse to operational system with realtime/near

realtime processing and integration into the production loop.

Major project: store and use data from new EUREX trading system T7,

store and report data from risk system PRISMA, receive and use data from new

EUREX clearing system C7

Tools: Oracle Database, Informatica, MicroStrategy, ControlM, Webmethods

New: AMQP (bi-directional), Java programming, T7 client, C7 client

Most important inbound data:

T7, C7, PRISMA

Most important outbound data:

Reporting and analytics on T7, C7 and PRISMA data

Size: Large to very large.

Page 29: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Getting Operational.Do more, at once, at same time, realtime and at no costs.

Deutsche Börse Group 28

Main requirements:

Downtime due to loss of service (hardware or datacenter fails) < 2h.

Separation (hardware, not only logically) of simulation and production.

Production possible in different datacenters.

No data loss on physical or logical database failure.

Page 30: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Getting Operational.Overview StatistiX Environments And Basic Components As Of Today.

Deutsche Börse Group 29

STX DEV PRISMA DEV STX/

PRISMA ACC

STX/PRISMA

SIM

STX PRD PRSIMA PRD

WH

Databases

StatistiX File Server

Control

M

Inbound

File Server

Source

Systems

Outbound

File Server

Target

Systems

Repository

Database

Informatica

MatLab

Mathematica

Java Engines

OS Scripts

MicroStrategy

Webmethods

Non-S

tatistix

Com

ponents

StatistiX Components in each environment

Inbound

Source

Systems

Outbound

Target

Systems

STX Environment

World

Page 31: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Getting Operational.Overview StatistiX Environment Production As Of Today

Deutsche Börse Group 30N

on-S

tatistix

Co

mp

on

en

ts

W

o

r

l

d

Datacenter Hausen Datacenter Bergen-Enkheim

Applications

Hot

dwhpas4/6

dwhpbill1

ControlM Server Hot

Datacenter

DWH_PROD

Applications

Hot

dwhpas7/8

Webmethods

Hot

Dwhpmet1/3

MicroStrategy

IS Cold

frpstat1

MicroStrategy

Web Cold

frpstat3

MicroStrategy

IS Hot

frpstat2

MicroStrategy

Web Hot

frpstat4

MicroStrategy

NC Cold

frpstat8

MicroStrategy

NC Hot

frpstat7

dwhsbill1

ControlM Server

Cold Datacenter

DWH_PROD

NFS Hot +

Informatica

Options FB

dwhpnfs

Informatica

Options Hot

+ NFS FB

dwhpasrt

Veritas Cluster

Dwhpdb3 Oracle DB Primary

dtb

Storage Array P2000

Dwhsdb3 Oracle DB Phys. Standby

prisma dtb

Storage Array P2000

Morepha5/6 Oracle DB

dwhprep

MatLab

Hot

frpires03

Prisma

DTB/NTA

Data Guard

Orchestration +

file transportFile transport

Dwhpprisdb1 Oracle DB Primary

prisma

Storage Array P2000

Ndwhpdb2 Oracle DB Primary

nta

Storage Array P2000

Page 32: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Getting Operational.Overview StatistiX Environment Simulation As Of Today

No

n-S

tatistix

Co

mp

on

en

ts

W

o

r

l

d

Datacenter Hausen Datacenter Bergen-Enkheim

Applications

Hot

dwhsas4/6

dwhpbill1

ControlM Server Hot

Datacenter

DWH_SIMU

Applications

Hot

dwhsas5/7

Webmethods

Hot

vdwhsmet1

MicroStrategy

IS/Web/NC

Hot

frsstat1

dwhsbill1

ControlM Server

Cold Datacenter

DWH_SIMU

NFS Hot +

Informatica

Options FB

dwhsnfs

Informatica

Options Hot

+ NFS FB

dwhsasrt

Veritas Cluster

dwhsdb4/5 Oracle DB Phys. Standby

dtbsimu

Hitachi HUSVM Hitachi HUSVM

Morepha5/6 Oracle DB

dwhsrep

Orchestration +

file transportFile transport

Deutsche Börse Group 31

Page 33: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Deutsche Börse Group 32

Datacenter IDevelopment/Simulation

Datacenter IIProduction

Oracle

DB StatistiX

production database

standby

slow disk area

production database

primary

slow disk area

simulation database

Oracle

DB SIM

Oracle

DB PRD

standby

Oracle

DB PRD

standbyData Guard

fast disk area fast disk area

Oracle

DB StatistiX

StatistiX Getting Operational.Overview StatistiX Environments And Basic Components To Be Implemented

Hita

ch

i H

US

VM

SA

N s

tora

ge

1P

T

Backup II

Backup I

Data

Hita

ch

i H

US

VM

SA

N s

tora

ge

1P

T

Backup II

Backup I

Data

Page 34: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Getting Operational.Do more, at once, at same time, realtime and at no costs.

Deutsche Börse Group 33

Architectural issues with database(s):

Several databases for special purpose (T7, prisma).

Special databases are small or midsize, but same effort as for main database.

Projects very often need data from their own database, but additional data from

main database (database links -> poor sql).

Non-optimal usage of resources (free storage, whereas somewhere else needed,

unused capacities because usage during daytime or nighttime only).

Current infrastructure does not match needs of usage (hot-warm-cold data,

growing rappidly related to past, sandboxing, …) and scalability.

High initial costs and maintenance.

Page 35: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Life without appliance is possible, yet pointless?Freely adapted from Loriot’s famous quote ‘Life without a pug is possible, yet pointless!’

Deutsche Börse Group 34

Where will wego from here?

Stay with

what you

know!

Something

else?

Suicide?Get a

hippster?

Page 36: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Life without appliance is possible, yet pointless?Freely adapted from Loriot’s famous quote ‘Life without a pug is possible, yet pointless!’

Deutsche Börse Group 35

Yes you can!

Not an easy way.

Effort rises permanently.

Higly complex environment.

More and more a specialist job.

Small and midsize environment.

If it makes life easier, you need not!

Reduce complexity.

Establish consolidated homogenious environment.

Integration instead of separation.

Clearly defined responsibility.

Keep up to date with new technology and it‘s

advantages.

Page 37: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Life without appliance is possible, yet pointless?Future of StatistiX?

Deutsche Börse Group 36

Exadata for limited in size database.

BDA for off-loading old, staging only or mass data.

One access to data using SQL and database security.

Next steps: use BDA in native way for controlled batch environment.

Future: Get business specialists

Page 38: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Thank You !

Axel Goldbach Manfred Wollmann

Deutsche Börse Group in-factory GmbH

[email protected] [email protected]

Page 39: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Backup

Page 40: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Content

Deutsche Börse Group 39

04 The Challenge 20 Reaching the Goal

07 The StatistiX Philosophy 24 Abstraction and Visual Tools

13 The Pillars of StatistiX 37 StatistiX Architecture

Page 41: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

The Pillars of StatistiX

13

Page 42: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

The Pillars of StatistiX

Deutsche Börse Group 41

Choosethe

rightPeople

Let theDatabase

be theCentreof your

Universe

UseProblemDomainspecific

Tools

StatistiX

Page 43: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Choose the right People

Deutsche Börse Group 42

StatistiX

That's what it's all about:

The right mix !

Page 44: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Let the Database be the Centre of your Universe

Deutsche Börse Group 43

Working with Relational Databases is Mathematics:

𝐴 𝑇𝑎𝑏𝑙𝑒 𝐶𝑜𝑙𝑢𝑚𝑛 𝑖𝑠 𝑎 𝑆𝑒𝑡 ⇒𝐴 𝑇𝑎𝑏𝑙𝑒 𝑖𝑠 𝑎 𝐶𝑟𝑜𝑠𝑠 𝑃𝑟𝑜𝑑𝑢𝑐𝑡 𝑜𝑓 𝑆𝑒𝑡𝑠 ⇒𝑇ℎ𝑒 𝐶𝑜𝑛𝑡𝑒𝑛𝑡 𝑜𝑓 𝑎 𝑇𝑎𝑏𝑙𝑒 𝑖𝑠 𝑎 𝑆𝑒𝑡 𝑜𝑓 𝑇𝑢𝑝𝑙𝑒𝑠 𝑤𝑖𝑡ℎ 𝑎𝑛 𝑂𝑟𝑑𝑒𝑟 𝑑𝑒𝑓𝑖𝑛𝑒𝑑 𝑜𝑛 𝑖𝑡 ⇒𝑇ℎ𝑒 𝑄𝑢𝑒𝑟𝑦 𝐿𝑎𝑛𝑔𝑢𝑎𝑔𝑒 𝑆𝑄𝐿 𝑖𝑠 𝑆𝑒𝑡 𝑇ℎ𝑒𝑜𝑟𝑦∧A𝑙𝑙 𝑃𝑒𝑜𝑝𝑙𝑒 𝑤𝑜𝑟𝑘𝑖𝑛𝑔 𝑓𝑜𝑟 𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑋 ℎ𝑎𝑣𝑒 𝑎𝑛 𝐴𝑐𝑎𝑑𝑒𝑚𝑖𝑐𝑎𝑙 𝐵𝑎𝑐𝑘𝑔𝑟𝑜𝑢𝑛𝑑𝑖𝑛𝑐𝑙𝑢𝑑𝑖𝑛𝑔 𝑀𝑎𝑡ℎ𝑒𝑚𝑎𝑡𝑖𝑐𝑠

q.e.d.

Page 45: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Use Problem Domain specific Tools (1)

An Excerpt

Deutsche Börse Group 44

Goal Criterion Example

Focus on “what to do”

not on “how to do”Highest possible abstraction

Don’t think about how to bring master data

together prices, just do it

Use data, don‘t think about how and where it is

stored

Multi source/target types in one processing

step

Table from Oracle, DB2, Sybase, Greenplum

etc. together with Data structures like CSV,

XML etc.

Configure technical constraints,

don‘t develop themIt‘s there, just configure it

Buffering, caching for processes and

parallelisation of them

Make business processes visible and

controllable

Realtime visual views on batch and manual

processesBatch monitoring, BPM

Cost transparency Extensive database based loggingCalculate a service’s costs on KPIs like CPU

time, processed rows, used storage etc.

Know dependencies between systems Database based repositoriesAnalyze the chain from orchestration to which

job writes to which table

Page 46: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Use Problem Domain specific Tools (2)

An Excerpt

Deutsche Börse Group 45

Problem Domain Tool

High performance data loading of huge volumes Informatica

Implementation, execution, and monitoring of processes running jobs on different servers with

different operation systemsControl-M

On business level defined business intelligence and reporting (scheduled or ad-hoc) MicroStrategy

Complex calculations with modern mathematical libraries Mathematica / MATLAB

Implementation, execution, and monitoring of business processes webMethods BPM

Integration of various systemsInformatica,

webMethods IntegrationServer

Page 47: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Use Problem Domain specific Tools (3)

Deutsche Börse Group 46

Third Party Tools In-house Development

Oracle, Java, Shell Script, Network etc.

Development

Technology

Business

Abstraction

~98%

~2%

Activities at StatistiX

Page 48: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Reaching the Goal

20

Page 49: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Reaching the Goal

Deutsche Börse Group 48

Enable business people to develop systems

-

Abstraction of technology to functionality

Avoid bugs

-

Configure frameworks instead of develop code

People are eye-minded (75%)

-

Use visual development tools

People get bored by routine

-

Avoid repeating work

People should not focus on solving technical challenges

-

Use all-in-one solutions for specific problem domains

Page 50: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Advantages of Abstraction and Problem Domain Specific Tools

Deutsche Börse Group 49

Rapidly rising learning curve

Less bugs

Systems are self-descriptive People are more motivated

People earlier become productive

Page 51: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Advantages of extensive Logging

Deutsche Börse Group 50

Job

Where did it run?

When did it run?

How long did it run?

How many of which resources

did it consume?

Peaks in Server Usage

Determine Bottlenecks

Fulfil SLAs

Hard Disk Space, CPU Usage,

Number of Rows processed

Development

Resource Planning

Billing

Operations

Measurement Measurement Areas

Page 52: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Abstraction and Visual

Tools

22

Page 53: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Visual Tools

Deutsche Börse Group 52

webMethods

Control-M

Informatica

Page 54: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Mapping

What is ETL?

Deutsche Börse Group 53

SAbstract

Sources

Abstract

Targets

Ab

str

actio

n

Feed

Database

File

Ab

stra

ctio

n

Feed

Database

Extract Transform Load

File

Area of Development

Page 55: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Focus on “what to do” not on “how to do” (1/3)

Sum up the traded volume per day and instrument

Deutsche Börse Group 54

Get all prices

Get listing master data

Get instrument master data

Get ISIN for listing

Bring ISIN together with the price

Aggregate price

per day and instrument

Aggregated volume

Sources are abstract – could be a database table, a CSV or XML file etc.

Page 56: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Focus on “what to do” not on “how to do” (2/3)

Sum up the traded volume per day and instrument

Deutsche Börse Group 55

How to bring listing master data together with the prices

Page 57: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Focus on “what to do” not on “how to do” (3/3)

Sum up the traded volume per day and instrument

Deutsche Börse Group 56

Aggregate byHow to aggregate

Page 58: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

High Performance Data Loading

Informatica enables us to process

up to 550.000 records per second

in Eurex MMPM calculation

• Measured in a calculation step (so it is not only „simple“ loading)

• Informatica offers all capabilities of parallel processing with configuration only

• No technical knowledge about multi-threading, pipes, shared memory etc. necessary

Deutsche Börse Group 57

Page 59: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Visual Modelling of Executable Business Processes

4-Eyes-Principle

Deutsche Börse Group 58

TriggerFailed step

Successfully

completed step

Step not

yet started

Condition

Central error handlingProcess name

Page 60: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Visual Programming

Freefloat Calculation for DAX Indices

Deutsche Börse Group 59

Projects

Service

Service implementation

Service call

Signature of the service to call

Page 61: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Orchestration of Batch Job Nets

Deutsche Börse Group 60

Job successfully ended

Job currently running

Job waiting for executionJob net overview

Job failed

Page 62: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Mathematical Computation (1/2)

When was the last time you spent a pleasant evening

in a comfortable chair, reading a good program?

Jon Louis Bentley

Mathematician and Computer Scientist

Communications of the ACM, 1986

Deutsche Börse Group 61

Page 63: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Mathematical Computation (2/2)

Deutsche Börse Group 62

Isn‘t that more understandable

than the lines above?

Documentation and

„Code“ at the same

place tells morely

a story than being

technical description

of what to do

Same formula after symbolic simplification

Page 64: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

MicroStrategy BI Platform

Deutsche Börse Group 63

Page 65: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Architecture

37

Page 66: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Architecture – The Outer View

Deutsche Börse Group 65

StatistiX

GUI

File

Feed

Inbound Outbound

GUI

Feed

File

Email Email

ESB ESB

Database

A

p

p

l

i

c

a

t

i

o

n

A

p

p

l

i

c

a

t

i

o

n

Page 67: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Interfaces

Deutsche Börse Group 66

Type Tool Inbound Outbound

GUI

HTML

webMethods CAF

MicroStrategy Web Interface

Feed

CEF

OFI

ESB Prisma

Mail

Lotus Notes

SMTP

File

Upload via GUI

Download via GUI

ftp

Filesystem

Page 68: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

StatistiX Architecture – The Inner View

Deutsche Börse Group 67

Database Database

Page 69: StatistiX An Introduction - Oracle DWH · StatistiX An Introduction ... or on a specified date (expiration date). What ??? ... Oracle Database, Informatica, MicroStrategy, ControlM,

Tools and Layers

Deutsche Börse Group 68

Orchestration

Other

Applications

Technical

Abstraction,

Native

Access

Database

Business

Abstraction

Informatica

Feature

Strength

Oracle

StatistiX

OLTP

Engine

StatistiX

Realtime

Interface

Mathematica

Control-M

Informatica

MicroStrategy

webMethods

Mathematica