Top Banner
Chapter 3 Databases and Data Warehouses Building Business Intelligence
45
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Chapter 3 Databases and Data Warehouses

Chapter 3

Databases and

Data Warehouses

Building Business Intelligence

Page 2: Chapter 3 Databases and Data Warehouses

Overview

• Business Intelligence

• The Relational Database Model

• Database Management System Tools

• Data Warehouses and Data Mining

• Managing The Information Resource In An Organization

Page 3: Chapter 3 Databases and Data Warehouses

Opening Case StudyHigh Tech Battles High School Truancy

• Organizations need databases (and data warehouses) for organizing and managing information.

• Why are the implementation of security and privacy measures difficult?

Page 4: Chapter 3 Databases and Data Warehouses

Introduction

• Databases and data warehouses are methods for organizing and managing information and business intelligence.

• Database management systems and data mining tools are IT tools you use to work with information and business intelligence.

Page 5: Chapter 3 Databases and Data Warehouses

• Business intelligence - is knowledge about your:

– Customers

– Competitors

– Partners

– Competitive environment

– Internal operations

Business Intelligence

Page 6: Chapter 3 Databases and Data Warehouses

Business Intelligence

Page 7: Chapter 3 Databases and Data Warehouses

Business Intelligence

• Two types of information processing include:

1. Online transaction processing (OLTP) - the gathering of input information, processing that information, and updating existing information to reflect the gathered and processed information.

• Operational databases - databases that support OLTP.

2. Online analytical processing (OLAP) - the manipulation of information to support decision making.

Page 8: Chapter 3 Databases and Data Warehouses

Business Intelligence

• A data warehouse is a special form of a database that contains information gathered from many operational databases for the purpose of supporting decision-making tasks.

Page 9: Chapter 3 Databases and Data Warehouses

The Relational Database Model

• Database - a collection of information that you organize and access according to the logical structure of that information.

• Relational database model - uses a series of logically related two-dimensional tables or files to store information in the form of a database.

• Relation - describes each two-dimensional table or file in the relational model.

Page 10: Chapter 3 Databases and Data Warehouses

The Relational Database Model

• Relational databases are composed of two parts:

1. Information – stored in a series of two-dimensional tables, files, or relations.

2. Logical structure of the information.

Page 11: Chapter 3 Databases and Data Warehouses

The Relational Database Model Collections of Information

Page 12: Chapter 3 Databases and Data Warehouses

The Relational Database Model Created with Logical Structures

• When you create a database, you first create the data dictionary.

• Data dictionary - contains the logical structure for the information.

Page 13: Chapter 3 Databases and Data Warehouses

The Relational Database Model Created with Logical Structures

Part NumberPart Number is the is the primary key because primary key because of the key icon beside of the key icon beside it.it.

For For Percentage MarkupPercentage Markup, , we defined its we defined its FormatFormat as “Percent” and its as “Percent” and its number of decimal number of decimal places as 2.places as 2.

Page 14: Chapter 3 Databases and Data Warehouses

The Relational Database Model With Logical Ties Among the Information

• Primary key - a field (or group of fields in some cases) that uniquely describes each record.

• Foreign key - a primary key of one file that appears in another file.

Page 15: Chapter 3 Databases and Data Warehouses

The Relational Database Model With Logical Ties Among the Information

Page 16: Chapter 3 Databases and Data Warehouses

The Relational Database Model With Built-In Integrity Constraints

• Integrity constraints – rules that help ensure the quality of the information.

Team Work

Primary Keys, Foreign Keys, and Integrity Constraints

(p. 133)(p. 133)

Page 17: Chapter 3 Databases and Data Warehouses

Database Management System Tools

• Database management system (DBMS) – helps you specify the logical organization for a database and access and use the information within a database.

• A DBMS contains the following five important software components:– DBMS engine– Data definition subsystem– Data manipulation subsystem– Application generation subsystem– Data administration subsystem

Page 18: Chapter 3 Databases and Data Warehouses

Database Management System Tools

Page 19: Chapter 3 Databases and Data Warehouses

• DBMS engine - accepts logical requests from the various other DBMS subsystems, converts them into their physical equivalent, and actually accesses the database and data dictionary as they exist on a storage device.

• Physical view - deals with how information is physically arranged, stored, and accessed on some type of storage device such as a hard disk.

• Logical view - focuses on how you arrange and access information to meet your particular business needs.

Database Management System Tools

Page 20: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Definition Subsystem

• Data definition subsystem - helps you create and maintain the data dictionary and define the structure of the files in a database.

(For a great overview on the logical properties of information review the table on page 136)

Page 21: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Manipulation Subsystem

• Data manipulation subsystem - helps you add, change, and delete information in a database and mine it for valuable information. Tools here include views, report generators, QBE, and SQL.

• View - allows you to see the contents of a database file, make whatever changes you want, perform simple sorting, and query to find the location of specific information.

Page 22: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Manipulation Subsystem

Click here to enter a Click here to enter a new record.new record.

Find information using Find information using the binoculars.the binoculars.

Page 23: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Manipulation Subsystem

• Report generator - helps you quickly define formats of reports and what information you want to see in a report.

Page 24: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Manipulation Subsystem

• By following a series of simple screens, you can easily create the report below.

Page 25: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Manipulation Subsystem

• Query-by-example (QBE) tools - help you graphically design the answer to a question.

Page 26: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Manipulation Subsystem

The QBE gridThe QBE grid

Our selection criteriaOur selection criteria

Page 27: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Manipulation Subsystem

• Structured query language (SQL) - a standardized fourth-generation query language found in most DBMSs.

• The SQL below creates the same report in Figure 3.7 on page 139.– SELECT Part.[Part Number], Part.Cost, Employee.[Employee

Name], Employee.[Employee Number] FROM Part, EmployeeWHERE (((Part.Cost)>10));

Page 28: Chapter 3 Databases and Data Warehouses

DBMS ToolsApplication Generation Subsystem

• Application generation subsystem - contains facilities to help you develop transaction-intensive applications. – Data entry screens– DBMS-specific programming languages– Commonly used programming languages

Page 29: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Administration Subsystem

• Data administration subsystem - a DBMS helps you manage the overall database environment by providing facilities for backup and recovery, security management, query optimization, concurrency control, and change management.

On Your Own

DBMS Support OLTP, OLAP, and

Information Management (p. 142)(p. 142)

Page 30: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Administration Subsystem

• Backup and recovery facilities:– Periodically back up information contained in a

database. – Restart or recover a database and its information in

case of a failure.

• Security management facilities - control who has access to what information and what type of access those people have.

Page 31: Chapter 3 Databases and Data Warehouses

DBMS ToolsData Administration Subsystem

• Query optimization facilities - take queries from users and restructure them to minimize response times.

• Reorganization facilities - continually maintain statistics concerning how the DBMS engine physically accesses information.

• Concurrency control facilities - ensure the validity of database updates when multiple users attempt to access and change the same information.

Page 32: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

What Is a Data Warehouse?

• Data warehouse - a logical collection of information – gathered from many different operational databases – used to create business intelligence that supports business analysis activities and decision-making tasks.

Page 33: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

What Is a Data Warehouse?

Page 34: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

What Is a Data Warehouse?

• Data warehouses are not transaction-oriented.

• Data warehouses support online analytical processing (OLAP).

Page 35: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

What Are Data Mining Tools?

• Data mining tools - software tools you use to query information in a data warehouse. These tools include:– Query-and-reporting tools - similar to QBE tools,

SQL, and report generators in the typical database environment.

– Intelligent agents – use various artificial intelligence tools to form the basis of information discovery and building business intelligence in OLAP.

Page 36: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

What Are Data Mining Tools?

• Data mining tools continued– Multidimensional analysis (MDA) tools -

slice-and-dice techniques that allow you to view multidimensional information from different perspectives.

– Statistical tools – help you apply various mathematical models to the information stored in a data warehouse to discover new information.

Page 37: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

What Are Data Mining Tools?

Page 38: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

Data Marts – Smaller Data Warehouses

• Data mart - a subset of a data warehouse in which only a focused portion of the data warehouse information is kept.

Page 39: Chapter 3 Databases and Data Warehouses

Data Warehouses and Data Mining

Important Considerations• Do you need a data warehouse?

• Do all your employees need an entire data warehouse?

• How up-to-date must the information be?

• What data mining tools do you need?

Team Work

How Up-to-Date Should Data Warehouse

Information Be? (p. 149)(p. 149)

Page 40: Chapter 3 Databases and Data Warehouses

MANAGING THE INFORMATION RESOURCE Who Should Oversee the Organization’s

Information?• Chief information officer (CIO) - responsible for

overseeing an organization’s information resource.

• Data administration - plans for, oversees the development of, and monitors the information resource.

• Database administration - responsible for the more technical and operational aspects of managing the information contained in organizational databases.

Page 41: Chapter 3 Databases and Data Warehouses

MANAGING THE INFORMATION RESOURCE How Will Changes in Technology Affect Organizing and Managing Information?

• As new technologies become available, you should ask yourself whether those technologies will help you organize and manage your information better.

• One of the greatest technological changes that will occur over the coming years is a convergence of different tools that will help you better organize and manage information.

Page 42: Chapter 3 Databases and Data Warehouses

MANAGING THE INFORMATION RESOURCE

Is Information Ownership a Consideration?

• Information ownership is a key consideration in today’s information-based business environment.

• Ownership refers to who is responsible for information quality.

On Your Own

CRUD – Defining Information Ownership

(p. 151)(p. 151)

Page 43: Chapter 3 Databases and Data Warehouses

MANAGING THE INFORMATION RESOURCE What Are the Ethics Involved in Managing

and Organizing Information?

• Databases, data warehouses, DBMSs, and data mining tools make it possible for people to easily access all kinds of organizational information.

• How does an organization safeguard against the unethical use of information within the organization?

Page 44: Chapter 3 Databases and Data Warehouses

Closing Case Study OneWe’ve Got OLTP Covered; Let’s Go on to

OLAP

• What is the single most important factor that hinders all organizations in general from providing good online analytical processing (OLAP) support?

• Why is it so much easier for organizations to provide good online transaction processing (OLTP) support?

Page 45: Chapter 3 Databases and Data Warehouses

Closing Case Study TwoMining Dining Data

• Consider the issue of timely information with respect to the businesses discussed in the case.

• Which of the businesses must have the most up-to-date information in its data warehouse?