Top Banner
Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others can understand, the third understands neither for itself nor through others. This first kind is excellent, the second good, and the third kind useless. Machiavelli, The Prince, 1513.
54

Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Dec 23, 2015

Download

Documents

Diane Davis
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Organizational intelligence technologies

There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others can understand, the third understands

neither for itself nor through others. This first kind is excellent, the second good, and the third kind useless.

Machiavelli, The Prince, 1513.

Page 2: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Organizational intelligence

Organizational intelligence is the outcome of an organization’s efforts to collect store, process, and interpret data from internal and external sourcesIntelligence in the sense of gathering and distributing information

Page 3: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Types of information systems

Type of information system

System’s purpose

Transaction processing system

TPS

Collects and stores data from routine transactions

Management information system

MIS

Converts data from a TPS into information for planning, controlling, and managing an organization

Decision support system

DSS

Supports managerial decision making by providing models for processing and analyzing data

Business Intelligence

BI

Enables the business to develop a better understanding of its key stakeholders and organizational environment

On-line analytical processing

OLAP

Presents a multidimensional, logical view of data to the analyst with no requirements as to how the data are stored

Data mining Uses statistical analysis and artificial intelligence techniques to identify hidden relationships in data

Page 4: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

The information systems cycle

Page 5: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Transaction processing systems

Can generate huge volumes of dataA telephone company may generate several hundred million records per dayRaw material for organizational intelligence

Page 6: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

The problem

Organizational memory is fragmented

Different systemsDifferent database technologiesDifferent locations

An underused intelligence system containing undetected key facts about customers

Page 7: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

The data warehouse

A repository of organizational dataCan be measured in petabytes (1015)

Page 8: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Managing the data warehouse

ExtractionTransformationCleaningLoadingSchedulingMetadata

Page 9: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Extraction

Pulling data from existing systemsOperational systems were not designed for extraction to load into a data warehouseApplications are often independent entitiesTime consuming and complexAn ongoing process

Page 10: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Transformation

Encodingm/f, male/female to M/F

Unit of measureinches to cms

Fieldsales-date to salesdate

Datedd/mm/yy to yyyy/mm/dd

Page 11: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Cleaning

Same record stored in different departmentsMultiple records for a companyMultiple entries for the same organizationMisuse of data entry fields

Page 12: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Scheduling

A trade-offToo frequent is costlyInfrequently means old data

Page 13: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Metadata

A data dictionary containing additional facts about the data in the warehouse

Description of each data typeFormat Coding standardsMeaningOperational system sourceTransformationsFrequency of extracts

Page 14: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Warehouse architectures

CentralizedFederatedTiered

Page 15: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Centralized data warehouse

Page 16: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Federated data warehouse

Page 17: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Tiered data warehouse

Page 18: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

The hardware/software decision

The default is rapidly becomingHadoop for file managementMapReduce for programmingCommodity nodes for processing

Page 19: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Exploiting data stores

Verification and discoveryData miningOLAP

Page 20: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Verification and discovery

Verification Discovery

What is the average sale for in-store and catalog customers?

What is the best predictor of sales?

What is the average high school GPA of students who graduate from college compared to those who do not?

What are the best predictors of college graduation?

Page 21: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

OLAP

Relational model was not designed for data synthesis, analysis, and consolidationThis is the role of spreadsheets and other special purpose softwareNeed to complement RDBMS technology with a multidimensional view of data

Page 22: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

TPS versus OLAP

TPS OLAP

Optimize for transaction volume

Optimize for data analysis

Process a few records at a time

Process summarized data

Real time update as transactions occur

Batch update (e.g., daily)

Based on tables Based on hypercubes

Raw data Aggregated data

SQL is widely used MDX becoming a standard

Page 23: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

ROLAP

A relational OLAPA multidimensional model is imposed on a relational structureRelational is a mature technology with extensive data management featuresNot as efficient as OLAP

Page 24: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

The star structure

A central fact table is connected to multiple dimensional tables

A single join can relate the fact table with any one of the dimensional tables

Page 25: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

The snowflake structure

An extension of the star schema to handle very large dimensional tables

Multiple joins might be required to fetch data.

Page 26: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Rotation

Page 27: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Drill down

Region Sales variance

Africa 105%

Asia 57%

Europe 122%

North America 97%

Pacific 85%

South America 163%

Nation Sales variance

China 123%

Japan 52%

India 87%

Singapore 95%

Page 28: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

A hypercube

Page 29: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

A three-dimensional hypercube display

Page Columns

Region: North

Sales

Red blob

Blue blob

Total

1996

Rows 1997

Year Total

Page 30: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

A six-dimensional hypercube

Dimension Example

Brand Mt. Airy

Store Atlanta

Customer segment

Business

Product group Desks

Period January

Variable Units sold

Page 31: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

A six-dimensional hypercube display

Page Columns

MonthSegment

Product groupVariable

March Business Desks Chairs

Units Revenue Units Revenue

Carolina Atlanta

Boston

Rows Mt. Airy Atlanta

Brand Boston

Store Totals

Page 32: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

The link between RDBMS and MDDB

Page 33: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

MDDB design

Key conceptsVariable dimensions• What is tracked

• Sales

Identifier dimensions• Tagging what is tracked

• Time, product, and store of sale

Page 34: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Prompts for identifying dimensions

Prompt ExampleWhen? June 5, 2013

10:27amWhere? ParisWhat? TentHow? CatalogWho? Young adult womanWhy? Camping trip to

BoliviaOutcome?

Revenue of €624.00

Transaction data

Transaction data

Face recognition or credit card co.

Social media

Page 35: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Variables and identifiers

Identifier time (hour)

Variablesales

(dollars)

10:00 523

11:00 789

12:00 1,256

13:00 4,128

14:00 2,634

Identifier

hit

Variabletime (hh:mm:ss)

1 9:34:45

2 9:34:57

3 9:36:12

4 9:41:56

Page 36: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Exercise

An international hotel chain has asked you to design a multidimensional database for its marketing department. What identifier and variable dimensions would you select?

Page 37: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Analysis and variable type

Identifier dimension

Continuous Nominal or ordinal

Variable dimension

Continuous

Regression and curve fittingSales by quarter

Analysis of varianceSales by store

Nominal or ordinal

Logistic regression Customer response (yes or no) to the level of advertising

Contingency table analysisNumber of sales by region

Page 38: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Multidimensional expressions (MDX)

A language for reporting data stored in a multidimensional databaseSQL like SELECT {[measures].[unit sales] }

ON COLUMNS FROM [sales]MeasuresUnit sales

266,773

Page 39: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Pentaho

Open source Business Intelligence projectBuilds on Mondrian, Jpivot, and other open source BI productsHome page

Page 40: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Data mining

The search for relationships and patternsApplications

Database marketingPredicting bad loansDetecting flaws in VLSI chipsIdentifying quasars

Page 41: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Data mining functions

Associations85 percent of customers who buy a certain brand of wine also buy a certain type of pasta

Sequential patterns32 percent of female customers who order a red jacket within six months buy a gray skirt

ClassifyingFrequent customers as those with incomes about $50,000 and having two or more children

ClusteringMarket segmentation

PredictingPredict the revenue value of a new customer based on that person’s demographic variables

Page 42: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Data mining technologies

Decision treesGenetic algorithmsK-nearest-neighbor methodNeural networksData visualization

Page 43: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

SQL-99 and OLAP

SQL can be tedious and inefficientThe following questions require four queries

Find the total revenueReport revenue by locationReport revenue by channel Report revenue by location and channel

Page 44: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

SQL-99 extensions

GROUP BY extended withGROUPING SETSROLLUPCUBE

MySQL supports only ROLLUP and in a slightly different format

Page 45: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

GROUPING SETSSELECT location, channel, SUM(revenue)FROM expedGROUP BY GROUPING SETS (location, channel);

Page 46: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

GROUPING SETS

Location Channel Revenue

null Catalog 108762

null Store 347537

null Web 27166

London null 214334

New York null 39123

Paris null 143303

Sydney null 29989

Tokyo null 56716

Page 47: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

ROLLUP

SELECT location, channel, SUM(revenue)FROM expedGROUP BY ROLLUP (location, channel);

Page 48: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

ROLLUPLocation Channel Revenue

null null 483465London null 214334New York null 39123Paris null 143303Sydney null 29989Tokyo null 56716London Catalog 50310London Store 151015London Web 13009New York Catalog 8712New York Store 28060New York Web 2351Paris Catalog 32166Paris Store 104083Paris Web 7054Sydney Catalog 5471Sydney Store 21769Sydney Web 2749Tokyo Catalog 12103Tokyo Store 42610Tokyo Web 2003

Page 49: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

CUBE

SELECT location, channel, SUM(revenue)FROM expedGROUP BY CUBE (location, channel);

Page 50: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Location Channel Revenuenull Catalog 108762null Store 347537null Web 27166null null 483465London null 214334New York null 39123Paris null 143303Sydney null 29989Tokyo null 56716London Catalog 50310London Store 151015London Web 13009New York Catalog 8712New York Store 28060New York Web 2351Paris Catalog 32166Paris Store 104083Paris Web 7054Sydney Catalog 5471Sydney Store 21769Sydney Web 2749

Tokyo Catalog 12103

Tokyo Store 42610

Tokyo Web 2003

CUBE

Page 51: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

MySQL version of ROLLUPSELECT location, FORMAT(SUM(revenue),0)FROM expedGROUP BY location WITH ROLLUP;

SELECT location, channel, FORMAT(SUM(revenue),0)

FROM expedGROUP BY location, channel WITH ROLLUP;

Page 52: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Exercises

Using ClassicModelsCompute total payments by country without and with ROLLUPCompute total payments by country and year without and with ROLLUPCompute total value of orders by country, and product line without and with ROLLUP

Page 53: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

SQL OLAP extensions

UsefulNot as powerful as MDDB tools

Page 54: Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others.

Conclusion

Data management is an evolving disciplineData managers have a dual responsibility

Manage data to be in business todayManage data to be in business tomorrow

Data managers now need to support organizational intelligence technologies