F4109 – DATABASE SYSTEM UNIT 1 DATABASE SYSTEM
Nov 18, 2014
F4109 – DATABASE SYSTEM
UNIT 1
DATABASE SYSTEM
INTRODUCTION TO DATABASE
PART 1
prepared by: [email protected]
File-Based Systems
• Collection of application programs that perform services for the end users.
• Each program defines and manages its own data.
prepared by: [email protected]
File-Based Processing
prepared by: [email protected]
Limitations of File-Based Approach
• Separation and isolation of data– Each program maintains its own set of data.– Users of one program may be unaware of potentially
useful data held by other programs.
• Duplication of data– Same data is held by different programs.– Wasted space and potentially different values and/or
different formats for the same item.
prepared by: [email protected]
Limitations of File-Based Approach
• Data dependence– File structure is defined in the program code.
• Incompatible file formats– Programs are written in different languages, and so cannot
easily access each other’s files.
• Fixed Queries/Proliferation of application programs– Programs are written to satisfy particular functions.– Any new requirement needs a new program.
prepared by: [email protected]
Database Approach
• Arose because:– Definition of data was embedded in application
programs, rather than being stored separately and independently.
– No control over access and manipulation of data beyond that imposed by application programs.
• Result: – the database and Database Management System (DBMS).
prepared by: [email protected]
Database
• Shared collection of logically related data (and a description of this data), designed to meet the information needs of an organization.
• System catalogue (metadata) provides description of data to enable program–data independence.
• Logically related data comprises entities, attributes, and relationships of an organization’s information.
prepared by: [email protected]
Database Management System (DBMS)
• A software system that enables users to define, create, and maintain the database and that provides controlled access to this database.
prepared by: [email protected]
Database Management System (DBMS)
prepared by: [email protected]
Database Approach
• Data definition language (DDL).– Permits specification of data types, structures and any
data constraints. – All specifications are stored in the database.
• Data manipulation language (DML).– General enquiry facility (query language) of the data.
prepared by: [email protected]
Database Approach
• Controlled access to database may include:– A security system.– An integrity system.– A concurrency control system.– A recovery control system.– A user-accessible catalogue.
• A view mechanism.– Provides users with only the data they want or need to
use.
prepared by: [email protected]
Components of DBMS Environment
prepared by: [email protected]
•Can range from a PC to a network of computers.Hardware•DBMS, operating system, network software (if necessary) and also the application programs.Software•Used by the organization and a description of this data called the schema.Data•Instructions and rules that should be applied to the design and use of the database and DBMS.Procedures•Human who used or deal with the database systemPeople
Components of DBMS Environment
prepared by: [email protected]
Roles in the Database Environment
• Data Administrator (DA)• Database Administrator (DBA)• Database Designers (Logical and Physical)• Application Programmers• End Users (naive and sophisticated)
prepared by: [email protected]
History of Database Systems
FIRST GENERATION
•Hierarchical•Network
SECOND GENERATION
•Relational
THIRD GENARATION
• Obj Relational• Obj Oriented
prepared by: [email protected]
Hierarchical Database Model
• Logically represented by an upside down tree– Each parent can have many children– Each child has only one parent
prepared by: [email protected]
Hierarchical Database Model
ADVANTAGES
•Conceptual simplicity•Database security and integrity•Data independence•Efficiency
DISADVANTAGES
•Complex implementation•Difficult to manage and lack of standards•Lacks structural independence•Applications programming and use complexity•Implementation limitations (no M:N relationship)
prepared by: [email protected]
Network Database Model
Network schema (database organization)
Subschema (views of database per user)
Data management language
Identified 3 database component
prepared by: [email protected]
Network Database Model
• Each record can have multiple parents– Composed of sets - relationships– Each set has owner record and member record– Member may have several owners– A set represents a 1:M relationship between the owner
and the member
prepared by: [email protected]
Network Database Model
ADVANTAGE
S
Conceptual simplicityHandles more relationship typesData access flexibilityPromotes database integrityData independenceConformance to standards
DISADVANTA
GES
System complexityLack of structural independence
prepared by: [email protected]
Relational Database Model
• Perceived by user as a collection of tables for data storage
• Tables are a series of row/column intersections (a row corresponds to a record, a column to a field)
• Tables related by sharing common entity characteristic(s)
• RDBMS
prepared by: [email protected]
Relational Database Model
prepared by: [email protected]
Relational Database Model
ADVANTAGES
Structural independence
Improved conceptual simplicity
Easier database design, implementation, management, and use
Ad hoc query capability with SQL
Powerful database management system
DISADVANTAGES
Substantial hardware and system software overhead
Poor design and implementation is made easy
May promote “islands of information” problems
prepared by: [email protected]
ADVANTAGES OF DBMSs
• Control of data redundancy• Data consistency• More information from the same amount of
data• Sharing of data• Improved data integrity• Improved security• Enforcement of standards• Economy of scale
prepared by: [email protected]
ADVANTAGES OF DBMSs
• Balanced conflicting requirements• Improved data accessibility and
responsiveness• Increased productivity• Improved maintenance through data
independence• Increased concurrency• Improved backup and recovery services
prepared by: [email protected]
DISADVANTAGES OF DBMSs
• Complexity• Size• Cost of DBMS• Additional hardware costs• Cost of conversion• Performance• Higher impact of a failure
DATABASE ENVIRONMENT
PART 2
prepared by: [email protected]
Objectives of Three-Level Architecture
All users should be able to access
same data.
A user’s view is immune to changes made in other views
Users should not need to know
physical database storage details
DBA should be able to change database storage
structures without affecting the users’ views
Internal structure of database should be
unaffected by changes to physical aspects of
storage
DBA should be able to change conceptual
structure of database without affecting all users
1 2 3
4 5 6
prepared by: [email protected]
ANSI-SPARC Three-Level Architecture
prepared by: [email protected]
ANSI-SPARC Three-Level Architecture
EXTERNAL• Users’ view of the
database• Describes that
part of database that is relevant to a particular user
CONCEPTUAL• Community view
of the database • Describes what
data is stored in database and relationships among the data
INTERNAL• Physical
representation of the database on the computer
• Describes how the data is stored in the database
prepared by: [email protected]
Differences between Three Levels of ANSI-SPARC Architecture
prepared by: [email protected]
Data Independence
LOGICAL
Refers to immunity of external schemas to changes in conceptual schemaConceptual schema changes (e.g. addition/removal of entities)Should not require changes to external schema or rewrites of application programs
PHYSICAL
Refers to immunity of conceptual schema to changes in the internal schema.Internal schema changes (e.g. using different file organizations, storage structures/devices).Should not require change to conceptual or external schemas.
prepared by: [email protected]
Data Independence and the ANSI-SPARC Three-Level Architecture
prepared by: [email protected]
Database Languages
DDL
• Description language
• Define or modify the schema not manipulate data
• associated integrity and security constraints
DML
• Provides basic data manipulation operations on data held in the database
PRO-DML
• allows user to tell system exactly how to manipulate data
NONPRO-DML
• allows user to state what data is needed rather than how it is to be retrieved
4 GL
• Query Languages
• Forms Generators
• Report Generators
• Graphics Generators
• Application Generators