Top Banner
Chapter 1 Introduction to Database Management
30
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: ch01

Chapter 1

Introduction to Database Management

Page 2: ch01

Chapter 1. Introduction to Database Management

Initial Definitions

• Database: a collection of persistent and interrelated data that are shared

by many users and applications

• Data: symbolically represented raw facts (i.e., numeric, textual,

graphical, and often pictorial encoding) about things, people, and events

observed in the real world

• Information: data transformed (e.g., aggregated, sorted, clustered,

sampled, and depicted) for a specific purpose

MIS 6326: Database Management 1

Page 3: ch01

Chapter 1. Introduction to Database Management

Database Characteristics

• Persistent

– data residing on stable storage, not a temporary storage such as computer memory (Textbook’sdefinition)← relevance of intended usage

– Stored data can subsequently be removed from the database only by some explicit request to thedatabase management system, not as a side effect of a program execution. (C.J. Date’s definition)

– Lasts a long time (not transient). (e.x., birth date vs. age)

• Interrelated

– separate data units connected to provide a broader view

– entity: cluster of data about a topic (customer, student, loan)

– relationship: connection among entities

• Shared

– by multiple applications and users (maybe at the same time)

– a database as a common information repository and information exchange platform (file processingvs. database approaches)

MIS 6326: Database Management 2

Page 4: ch01

Chapter 1. Introduction to Database Management

An Example: University Database

University Database

Registration

Grade

Recording

Faculty

Assignment

Course

Scheduling

Entities:

students, faculty, courses,

offerings, enrollments

Relationships:

faculty teach offerings,

students enroll in

offerings, offerings made

of courses, ...

MIS 6326: Database Management 3

Page 5: ch01

Chapter 1. Introduction to Database Management

Database Management System (DBMS)

• Collection of components that support data acquisition,

dissemination, storage, maintenance, retrieval, and

formatting (Database Software)

• Major part of information technology infrastructure

– Enterprise DBMS: supporting mission critical information systems

– Desktop DBMS: for small and end-user databases

– Embedded DBMS: resides in a separate system or device such as

PDA

MIS 6326: Database Management 4

Page 6: ch01

Chapter 1. Introduction to Database Management

Features of DBMS

• Data Definition: Database Creation

• Nonprocedural Access: Data Retrieval

• Application Development: Data Input Screens, Reports,

User-Friendly Environments

• Procedural Language Interface: Serious IS Development

• Transaction Processing: Database Administration

• Database Tuning: Database Administration

MIS 6326: Database Management 5

Page 7: ch01

Chapter 1. Introduction to Database Management

Data Definition

• Define structures of data in a database: Tables and references

• SQL CREATE TABLE statement

Table: a two-dimensional

arrangement of data

SQL: an industry standard

database language

for data definition,

data manipulation, and

database control.

create table Offering (

OfferNo integer not null,

CourseNo char(6) not null,

OffTerm char(6) not null,

OffYear integer not null,

OffLocation varchar(30) null,

OffTime varchar(10) null,

FacSSN char(11) null,

OffDays char(4) null,

constraint OfferingPK primary key (OfferNo),

constraint CourseFK foreign key (CourseNo) references Course,

constraint FacultyFK foreign key (FacSSN) references Faculty )

• Graphical tools

MIS 6326: Database Management 6

Page 8: ch01

Chapter 1. Introduction to Database Management

Data Definition: A Graphical Tool (MS Access)

MIS 6326: Database Management 7

Page 9: ch01

Chapter 1. Introduction to Database Management

A Modeling Language (i.e., Tool) for Data Definition

Student

PK StdNo

StdFirstName

StdLastName

StdCity

StdState

StdZip

StdMajor

StdClass

StdGPA

Offering

PK OfferNo

FK2 CourseNo

FK1 FacNo

OffLocation

OffYear

OffTerm

OffDay

OffTime

Faculty

PK FacNo

FacFirstName

FacLastName

FacCity

FacState

FacZip

FacDept

FacRank

FacSalary

FK1 FacSupervisor

FacHireDate

Enrollment

PK,FK1 StdNo

PK,FK2 OfferNo

EnrGrade

Course

PK CourseNo

CrsDesc

CrsUnits

Registers

Accepts

Teaches

Supervises

has

MIS 6326: Database Management 8

Page 10: ch01

Chapter 1. Introduction to Database Management

Nonprocedural Access: Query

• Query: Request for data to answer a question

• Indicate what parts of database to retrieve, not how to retrieve

them (not the procedural details)

• Improve productivity and improve accessibility

• SQL SELECT statement

select StdFirstName, StdLastName, StdCity, OfferNo, EnrGrade

from enrollment, student

where enrollment.StdSSN = student.StdSSN

and EnrGrade > 3.5

• Graphical tools

MIS 6326: Database Management 9

Page 11: ch01

Chapter 1. Introduction to Database Management

Nonprocedural Access: A Graphical Tool (MS Access)

MIS 6326: Database Management 10

Page 12: ch01

Chapter 1. Introduction to Database Management

Application Development

• Form: formatted document for data entry and display

• Report: formatted document for display

• Use of nonprocedural access to specify data

requirements of forms and reports

– “View”: Standard SQL

– Stored “Query”: Access

MIS 6326: Database Management 11

Page 13: ch01

Chapter 1. Introduction to Database Management

A Sample Data Entry Form (MS Access)

MIS 6326: Database Management 12

Page 14: ch01

Chapter 1. Introduction to Database Management

A Sample Report Form (MS Access)

MIS 6326: Database Management 13

Page 15: ch01

Chapter 1. Introduction to Database Management

Procedural Language Interface

• Combine procedural language with nonprocedural access

• Why

– Batch processing

– Customization and automation

– Performance improvement

• Modes of Connection between a Language and a Database Software

– Direct API calls

– Middleware: ODBC, JDBC

• Examples

– Java or Visual Basic programming over a database

– Web database publishing

MIS 6326: Database Management 14

Page 16: ch01

Chapter 1. Introduction to Database Management

Transaction Processing

• Transaction: a unit of data modification work that

should be reliably processed

• Control simultaneous users

• Recover from failures

Database Tuning

• Analyze and improve database system performance

MIS 6326: Database Management 15

Page 17: ch01

Chapter 1. Introduction to Database Management

Database Technology Evolution

Era Generation Orientation Major Features

1960s 1st File File structures and proprietary

program interfaces

1970s 2nd Network

Navigation

Networks and hierarchies

of related records, standard

program interfaces

1980s 3rd Relational Non-procedural languages,

optimization, transaction

processing

1990s to 2000s 4th Object Multi-media, active,

distributed processing, XML

enabled, data warehouse

processing, cloud computing

MIS 6326: Database Management 16

Page 18: ch01

Chapter 1. Introduction to Database Management

DBMS Marketplace

• Enterprise DBMS

– Oracle: dominates in Unix; strong in Windows

– SQL Server: strong in Windows

– DB2: dominates in mainframe

– Teradata: as a data warehouse platform

– Significant open source systems: MySQL, Firebird, PostgreSQL

• Desktop DBMS

– Access: dominates

MIS 6326: Database Management 17

Page 19: ch01

Chapter 1. Introduction to Database Management

Worldwide Market Share in 2004 & 2005

Source: IDC, May 2006

DB SW Rev. ($M) Share (%) Growth (%)Vendor 2004 2005 2004 2005 2004–2005

Oracle 5,982.4 6,494.7 45.0 44.6 8.6IBM 2,923.0 3,113.0 22.0 21.4 6.5Microsoft 2,013.0 2,441.5 15.1 16.8 21.3Sybase 470.9 502.6 3.5 3.5 6.7NCR/Teradata 390.0 423.0 2.9 2.9 8.5Others 1,528.9 1,590.8 11.5 10.9 4.1

Total 13,308.1 14,464.6 9.4

Source: Gartner Dataquest, May 2006

DB SW Rev. ($M) Share (%) Growth (%)Vendor 2004 2005 2004 2005 2004–2005

Oracle 6,234.1 6,721.1 48.9 48.6 7.8IBM 2,860.4 3,040.7 22.4 22.0 6.3Microsoft 1,777.9 2,073.2 13.9 15.0 16.6NCR/Teradata 412.1 440.7 3.2 3.2 6.9Sybase 382.8 407.0 3.0 2.9 6.3Others 1,090.4 1,134.7 8.5 8.2 4.1

Total 12,757.8 13,817.4 8.3

Open Source DatabaseDevelopment Survey (Source:Evans Data, January 2005)

DB Software Responses

MySQL 53%Firebird 52%PostgreSQL 15%Berkeley DB 4%GNU SQL 3%SAP DB 1%Others 6%

MIS 6326: Database Management 18

Page 20: ch01

Chapter 1. Introduction to Database Management

Worldwide Market Share in 2006

Source: Gartner, June 2007

• Yearly Growth: 14% ($15.2 billion in sales)

• Oracle: 47.1% market share ($7.2 billion in sales), 14.9% revenue growth

• IBM: 21.1% market share ($3.2 billion in sales), 8.8% revenue growth

• Microsoft: 17.4% market share ($2.65 billion in sales), 28% revenue growth

• Teradata: approx. 4% market share ($494 million in sales), 5.7% revenue growth

• Sybase: less than 4% market share ($488 million in sales), less than 5% revenue growth

MIS 6326: Database Management 19

Page 21: ch01

Chapter 1. Introduction to Database Management

Worldwide Market Share in 2007

Source: IDC, June 2008

• Yearly Growth: 12.6% ($18.8 billion in sales in 2007 up from $16.7 billion in 2006)

• Oracle: 44.3% market share ($8.3 billion in sales), 13.3% revenue growth

• IBM: 21.0% market share ($4 billion in sales), 13.3% revenue growth

• Microsoft: 18.5% market share ($3.5 billion in sales), 14% revenue growth

• Sybase: 3.5% market share

• Teradata: 3.3% market share

• Others: 9.4%

MIS 6326: Database Management 20

Page 22: ch01

Chapter 1. Introduction to Database Management

Data Independence

• Definition: A database should have an identity separate from

the applications that use it.

• To alleviate problems with software maintenance

– Software maintenance is a part (50%) of information system budgets.

– To reduce impact of changes by separating database description from

applications

– To change database definition with minimal effect on applications that

use the database

• Three Schema Architecture

– for compartmentalizing database descriptions in order to achieve data

independence.

MIS 6326: Database Management 21

Page 23: ch01

Chapter 1. Introduction to Database Management

Three Schema Architecture

View 1 View 2 View n

ConceptualSchema

InternalSchema

External

Level

Conceptual

Level

Internal

Level

External toConceptualMappings

Conceptualto InternalMappings

MIS 6326: Database Management 22

Page 24: ch01

Chapter 1. Introduction to Database Management

Differences among Levels

• External

– Faculty Assignment Form View: data required for the form in Slide 12

– Faculty Work Load Report View: data required for the report in Slide 13

• Conceptual: the whole database from users’ and DB

developers’ point of view (e.g., Slide 8)

• Internal

– Files needed to store the tables

– Extra files to improve performance

– The whole database from the point of view of hardware and physical

processing

MIS 6326: Database Management 23

Page 25: ch01

Chapter 1. Introduction to Database Management

Client-Server Architecture

Database

Databaseserver

a) Client-server processing with database server

Database

Database

serverMiddleware

server

b) Client-server processing with middleware and database servers

MIS 6326: Database Management 24

Page 26: ch01

Chapter 1. Introduction to Database Management

Parallel Database Architecture

M

N

...

P P P...

M M M

N

...

P P P...

M M

(a) SD (b) SN

Legend

P: processor

M: memory

N: high-speed network

SD: shared disk

SN: shared nothing

MIS 6326: Database Management 25

Page 27: ch01

Chapter 1. Introduction to Database Management

Distributed Database Architecture

Client Server Server

DatabaseDatabase

Client

Client

Client

Denver London

Server

Database

Tokyo

Client

Client

MIS 6326: Database Management 26

Page 28: ch01

Chapter 1. Introduction to Database Management

Cloud Computing

Server Server

DatabaseDatabase

Server

Database

• no initial product licensing costs and no hosting requirements

• web-based interfaces

• dynamic resource allocation

MIS 6326: Database Management 27

Page 29: ch01

Chapter 1. Introduction to Database Management

Database Specialists

• Database administrator (DBA)

– More technical

– DBMS specific skills

• Data administrator

– Less technical

– Planning role

– Information resource management

MIS 6326: Database Management 28

Page 30: ch01

Chapter 1. Introduction to Database Management

DBA Responsibilities

Technical Non-technical

Designing conceptual schemas Setting database standards

Designing internal schemas Devising training materials

Monitoring database performance Promoting benefits of

databasesSelecting and evaluating database

software

Consulting with users

Managing security for database usage Planning new databases

Troubleshooting database problems

MIS 6326: Database Management 29