Top Banner
1 Data & Databases Technology Technology Guide 3 Guide 3
27
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Data

1

Data & Databases

Technology Technology Guide 3Guide 3

Page 2: Data

2

File Management

BitByteFieldRecordFile

Database

Entity

Attribute

Key field

Key file management concepts include:

Page 3: Data

3

Hierarchy of Data

Page 4: Data

4

Accessing Records from Computer Files

In sequential file organization:

Data records must be retrieved in the same physical sequence in which they are stored.

In direct or random file organization: Users can access records in any sequence, without regard to actual physical order on the storage medium.

Page 5: Data

5

Problems Arising in the File Environment

Data redundancy: The same piece of information could be duplicated in several files.

Data inconsistency: The actual values across various copies of the data no longer agree.

Data isolation. Data files are likely to be organized differently, stored in different formats, and often physically inaccessible to other applications.

Security: is difficult to enforce in the file environment.

Page 6: Data

6

Problems Arising in the File Environment

Data Integrity: It is difficult to place data integrity constraints across multiple data files.

Application/ Data independence: In the file environment, the applications and their associated data files are dependent on each other.

The numerous problems arising from the file environment approach led to the development of databases. Database: an organized logical grouping of related files.

Page 7: Data

7

Database Management Systems

The program (or group of programs) that provides access to a database is known as a database management system (DBMS).

There are many specialized databases, depending on the type or format of data stored. A geographical information database contains locational data for

overlaying on maps or images.

A knowledge database stores decision rules used to evaluate situations and help users make decisions like an expert.

A multimedia database stores data on many media—sounds, video, images, graphic animation, and text.

Page 8: Data

8

Database Management Systems (cont.)

Three major components of a DBMS:

Data definition language

Data manipulation language Data dictionary

Page 9: Data

9

Data Definition Language (DDL)

DDL is the language used by programmers to specify the content and structure of the database.

A DBMS user defines views or schemes using the DDL.

A schema - the logical description of the entire database and the listing of all the data items and the relationships among them.

A subschema - the specific set of data from the database that is required by each application.

Page 10: Data

10

Data Manipulation Language (DML)

DML is used with a third- or fourth-generation language to manipulate the data in the database.

DML provides users with the ability to retrieve, sort, display, and delete the contents of a database.

Requesting information from a database is the most commonly performed operation. Structured query language (SQL) Query-by-example (QBE)

Page 11: Data

11

Data Dictionary

Data Dictionary is a file that stores definitions of data elements and data characteristics such as usage, physical representation, ownership, authorization, and security.

A data element represents a field.

Page 12: Data

12

Logical Data Organization

There are three basic models for logically structuring databases:

Hierarchical

Network

Relational

Three additional models are emerging:

Multidimensional

Object-oriented

Hypermedia

Page 13: Data

13

The Hierarchical Model

The hierarchical model relates data by rigidly structuring data into an inverted “tree” in which records contain two elements:

1. A single root or master field, often called a key, which identifies the type location, or ordering of the records.

2. A variable number of subordinate fields that defines the rest of the data within a record.

The hierarchical structure is commonly found in many traditional business organizations and processes.

Page 14: Data

14

The Networked-based Model

The network model creates relationships among data through a linked-list structure in which subordinated records (members) can be linked to more than one owner.

Explicit links, called pointers, are used to link subordinates and owners. That relationship is called a set.

Many-to-many relationships are possible with a network database model—a significant advantage of the network model over the hierarchical model.

Page 15: Data

15

The Relational Database Model

The relational model is based on a simple concept of tables in order to capitalize on characteristics of rows and columns of data, which is consistent with real-world business situations.

Tables are called relations, and the model is based on the mathematical theory of sets and relations.

A row is called a tuple, and a column is called an attribute.

One of the greatest advantages of the relational model is its conceptual simplicity and the ability to link records in a way that is not predefined.

Page 16: Data

16

Creating Databases

To create a database, designers must develop both a conceptual and physical design: Conceptual design - an abstract model of the database from the

user or business perspective. – Describes how the data elements in the database are to be grouped.

Physical design shows how the database is actually arranged on direct access storage devices.

Groups of data are organized, refined, and streamlined until an overall logical view of the relationships among all of the data elements in the database appears.

Page 17: Data

17

Database Structures

Page 18: Data

18

Entity Relationship Diagram

Database designers often document the conceptual data model with an entity-relationship (ER) diagram.

An entity is something that can be identified in the users’ work environment.

An instance of an entity is the representation of a particular entity. Entities have attributes, or properties, that describe the entity’s

characteristics. Entity instances have identifiers, which are attributes that identify

entity instances. Entities are associated with one another in relationships, which

can include many entities.

Page 19: Data

19

Normalization of Relational Databases

The process of creating small, stable data structures from complex groups of data is called normalization.

Specifically, normalization has several goals: Eliminate redundancy.

Avoid update anomalies (i.e., errors from inserting, deleting, and modifying records).

Represent accurately the item being modeled.

Simplify maintenance and information retrieval.

Page 20: Data

20

Emerging Database Models

The most common database models are:

Multimedia database Deductive databases Object-oriented databases Multimedia and hypermedia databases

Page 21: Data

21

Object-Oriented Database Model

Object-oriented (OO) databases store both data and procedures acting on the data, as objects.

The OO database can be particularly helpful in multimedia environments, such as in manufacturing sites using CAD/CAM.

OO databases can be particularly useful in supporting temporal

and spatial dimensions.

Terminology in the OO model includes: objects, attributes, classes, methods, and messages.

Page 22: Data

22

Hypermedia Database Model

The hypermedia database model stores chunks of information in the form of nodes connected by links established by the user.

The nodes can contain text, graphics, sound, full-motion video, or executable computer programs.

Users can branch to related information in any kind of relationship.

Page 23: Data

23

Data Warehouses

A data warehouse is an additional database that is designed to support DSS, EIS, online analytical processing (OLAP), and other end-user activities, such as report generation, queries, and graphical presentation.

A data mart is smaller, less expensive, and more focused than a large-scale data warehouse. Data marts can be a

substitution for a data warehouse, or they can be used in addition to it.

Page 24: Data

24

Database Typology

A centralized database has all the related files in one physical location.

A distributed database has complete copies of a database, or portions of a database, in more than one location, which is usually close to the user.

A replicated database has complete copies of the entire database in several locations.

A partitioned database is subdivided, so that each location has a portion of the entire database.

Page 25: Data

25

Physical vs. Logical Data View

How can a single, unified database meet the differing requirements of so many users?

A DBMS minimizes these problems by providing two “views” of the database data: The physical view deals with the actual, physical arrangement and

location of data in the direct access storage devices (DASD).

The logical view, or user’s view, represents data in a format that is meaningful to a user and to the software programs that process that data.

Page 26: Data

26

Database Management

Database management outside of purely technical hardware and software considerations, consists primarily of two functions:

Database design and implementation– Specialists should carefully consider the individual needs of all

existing and potential users.

Database administration– Database administrators are IT specialists responsible for

ensuring that the database fulfills the user’s business needs.

Page 27: Data

27

IP Storage Storage can be connected to

servers over IP (Internet protocol) networks, also known as IP storage.

This enables servers to connect to SCSI (small computer system interface) storage devices as if they were directly attached to the server, regardless of the location.