Top Banner
Distributed DBMS Outline Introduction What is a distributed DBMS Problems Current state-of-affairs Background Distributed DBMS Architecture Distributed Database Design Semantic Data Control Distributed Query Processing Distributed Transaction Management Parallel Database Systems Distributed Object DBMS Database Interoperability Current Issues
29

Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Aug 24, 2018

Download

Documents

phungnhu
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Outline❏ Introduction

➠ What is a distributed DBMS➠ Problems➠ Current state-of-affairs

❏ Background❏ Distributed DBMS Architecture❏ Distributed Database Design❏ Semantic Data Control❏ Distributed Query Processing❏ Distributed Transaction Management❏ Parallel Database Systems❏ Distributed Object DBMS❏ Database Interoperability ❏ Current Issues

Page 2: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

File Systems

program 1data description 1

program 2data description 2

program 3data description 3

File 1

File 2

File 3

Page 3: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Database Management

database

DBMS

Applicationprogram 1(with datasemantics)

Applicationprogram 2(with datasemantics)

Applicationprogram 3(with datasemantics)

descriptionmanipulation

control

Page 4: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Motivation

DatabaseTechnology

ComputerNetworks

integration distribution

integration

integration ≠ centralization

DistributedDatabaseSystems

Page 5: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Distributed Computing

� A concept in search of a definition and a name.

� A number of autonomous processing elements (not necessarily homogeneous) that are interconnected by a computer network and that cooperate in performing their assigned tasks.

Page 6: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

� Synonymous terms➠ distributed function➠ distributed data processing➠ multiprocessors/multicomputers➠ satellite processing➠ backend processing➠ dedicated/special purpose computers➠ timeshared systems➠ functionally modular systems

Distributed Computing

Page 7: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

� Processing logic

� Functions

� Data

� Control

What is distributed …

Page 8: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

What is a Distributed Database System?

A distributed database (DDB) is a collection of multiple, logically interrelated databases distributed over a computer network.

A distributed database management system (D–DBMS) is the software that manages the DDB and provides an access mechanism that makes this distribution transparent to the users.

Distributed database system (DDBS) = DDB + D–DBMS

Page 9: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

� A timesharing computer system� A loosely or tightly coupled multiprocessor

system� A database system which resides at one of the

nodes of a network of computers - this is a centralized database on a network node

What is not a DDBS?

Page 10: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Centralized DBMS on a Network

CommunicationNetwork

Site 5

Site 1Site 2

Site 3Site 4

Page 11: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Distributed DBMS Environment

CommunicationNetwork

Site 5

Site 1Site 2

Site 3Site 4

Page 12: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Implicit Assumptions� Data stored at a number of sites each site

logically consists of a single processor.� Processors at different sites are interconnected

by a computer network no multiprocessors➠ parallel database systems

� Distributed database is a database, not a collection of files data logically related as exhibited in the users’ access patterns

➠ relational data model � D-DBMS is a full-fledged DBMS

➠ not remote file system, not a TP system

Page 13: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Shared-Memory Architecture

Examples : symmetric multiprocessors (Sequent, Encore) and some mainframes (IBM3090, Bull's DPS8)

P1 Pn MD

Page 14: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Shared-Disk Architecture

Examples : DEC's VAXcluster, IBM's IMS/VS Data Sharing

DP1

M1

Pn

Mn

Page 15: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Shared-Nothing Architecture

Examples : Teradata's DBC, Tandem, Intel's Paragon, NCR's 3600 and 3700

P1

M1

D1

Pn

Mn

Dn

Page 16: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

� Manufacturing - especially multi-plant manufacturing

� Military command and control� EFT� Corporate MIS� Airlines� Hotel chains� Any organization which has a

decentralized organization structure

Applications

Page 17: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Distributed DBMS Promises

❶ Transparent management of distributed, fragmented, and replicated data

❷ Improved reliability/availability through distributed transactions

❸ Improved performance

❹ Easier and more economical system expansion

Page 18: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Transparency� Transparency is the separation of the higher level

semantics of a system from the lower level implementation issues.

� Fundamental issue is to providedata independence

in the distributed environment

➠ Network (distribution) transparency

➠ Replication transparency

➠ Fragmentation transparency� horizontal fragmentation: selection� vertical fragmentation: projection� hybrid

Page 19: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Example

TITLE SAL

PAY

Elect. Eng. 40000Syst. Anal. 34000Mech. Eng. 27000Programmer 24000

PROJ

PNO PNAME BUDGET

ENO ENAME TITLEE1 J. Doe Elect. Eng.E2 M. Smith Syst. Anal.E3 A. Lee Mech. Eng.E4 J. Miller ProgrammerE5 B. Casey Syst. Anal.E6 L. Chu Elect. Eng.E7 R. Davis Mech. Eng.E8 J. Jones Syst. Anal.

EMPENO PNO RESP

E1 P1 Manager 12

DUR

E2 P1 Analyst 24E2 P2 Analyst 6E3 P3 Consultant 10E3 P4 Engineer 48E4 P2 Programmer 18E5 P2 Manager 24E6 P4 Manager 48E7 P3 Engineer 36

E8 P3 Manager 40

ASG

P1 Instrumentation 150000

P3 CAD/CAM 250000P2 Database Develop. 135000

P4 Maintenance 310000

E7 P5 Engineer 23

Page 20: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Transparent Access

SELECT ENAME,SAL

FROM EMP,ASG,PAY

WHERE DUR > 12

AND EMP.ENO = ASG.ENO

AND PAY.TITLE = EMP.TITLE Paris projectsParis employeesParis assignmentsBoston employees

Montreal projectsParis projectsNew York projects

with budget > 200000Montreal employeesMontreal assignments

Boston

CommunicationNetwork

Montreal

Paris

NewYork

Boston projectsBoston employeesBoston assignments

Boston projectsNew York employeesNew York projectsNew York assignments

Tokyo

Page 21: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Distributed Database - User View

Distributed Database

Page 22: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Distributed DBMS - Reality

CommunicationSubsystem

UserQuery

DBMSSoftware

DBMSSoftware User

Application

DBMSSoftware

UserApplicationUser

QueryDBMS

Software

UserQuery

DBMSSoftware

Page 23: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Potentially Improved Performance

� Proximity of data to its points of use➠ Requires some support for fragmentation and replication

� Parallelism in execution➠ Inter-query parallelism

➠ Intra-query parallelism

Page 24: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Parallelism Requirements

� ❇Have as much of the data required by eachapplication at the site where the application executes

➠ Full replication

� How about updates?➠ Updates to replicated data requires implementation of

distributed concurrency control and commit protocols

Page 25: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

System Expansion

� Issue is database scaling

� Emergence of microprocessor and workstation technologies

➠ Demise of Grosh's law

➠ Client-server model of computing

� Data communication cost vs telecommunication cost

Page 26: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Distributed DBMS Issues

� Distributed Database Design➠ how to distribute the database➠ replicated & non-replicated database distribution➠ a related problem in directory management

� ❇Query Processing➠ convert user transactions to data manipulation

instructions➠ optimization problem➠ min{cost = data transmission + local processing}➠ general formulation is NP-hard

Page 27: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

Distributed DBMS Issues

� ❇Concurrency Control➠ synchronization of concurrent accesses➠ consistency and isolation of transactions' effects➠ deadlock management

� Reliability➠ how to make the system resilient to failures➠ atomicity and durability

Page 28: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

DirectoryManagement

Relationship Between Issues

Reliability

DeadlockManagement

QueryProcessing

ConcurrencyControl

DistributionDesign

Page 29: Outline - University of Winnipegion.uwinnipeg.ca/~ychen2/distributeDB/Introduction.pdf · Outline Introduction ... (with data semantics) description manipulation control. Di s t r

Distributed DBMS

� Operating System Support➠ operating system with proper support for database

operations➠ dichotomy between general purpose processing

requirements and database processing requirements� Open Systems and Interoperability

➠ Distributed Multidatabase Systems➠ More probable scenario➠ Parallel issues

Related Issues