IMS 6217: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida [email protected]Distributed Databases (DDB) • Introduction to DDB • Three DDB Exercises – General Electric – J.C. Penney Stores – UPS • Integrating Autonomous Systems • Replication • Replication Exercise • Implementing Replication
27
Embed
IMS 6217: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida [email protected] Distributed Databases (DDB) Introduction.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
IMS 6217: Distributed Databases
1Dr. Lawrence West, Management Dept., University of Central [email protected]
Distributed Databases (DDB)
• Introduction to DDB
• Three DDB Exercises
– General Electric
– J.C. Penney Stores
– UPS
• Integrating Autonomous Systems
• Replication
• Replication Exercise
• Implementing Replication
IMS 6217: Distributed Databases
2Dr. Lawrence West, Management Dept., University of Central [email protected]
Distributed Databases—Introduction
• What is a Distributed Database (DDB)?
• How many ways can a database be distributed?(And for how many reasons?)
IMS 6217: Distributed Databases
3Dr. Lawrence West, Management Dept., University of Central [email protected]
Distributed Databases—Introduction (cont.)
• A DDB runs on more than one computer (duhhhh)
– All or part of the data is distributed
– Servers may have unique data or copies of data also on other servers
– Some data may be on clients (not servers) for strictly local use
– Servers may be right next to each other (same room or even rack) or around the world
– DDB may share a strong schema (design) that applies to all instances or may have a logical schema that integrates disparate autonomous DB
IMS 6217: Distributed Databases
4Dr. Lawrence West, Management Dept., University of Central [email protected]
Distributed Databases—Introduction (cont.)
• Reasons for distributing data
– Workload distribution for scalability
– Getting data closer to users in a distributed organization
– Autonomous operating elements of a diverse organization have different data needs
• Reporting needs to central HQ
• Coordination needs
IMS 6217: Distributed Databases
5Dr. Lawrence West, Management Dept., University of Central [email protected]
Distributed Databases—Introduction (cont.)
• Three main types/reasons of/for distributing data
1. Diverse distributed organizational divisions have unique data needs with some requirements for centralized coordination and/or reporting
2. Homogeneous distributed organization requires geographically available data but strong needs for centralized management and control
3. Homogeneous organization requires workload distribution to support scalability
• Any combination of these circumstances can exist
IMS 6217: Distributed Databases
6Dr. Lawrence West, Management Dept., University of Central [email protected]
Exercises
• Three Distributed System Design Exercises
– General Electric
– J.C. Penney
– UPS
• Consider
– Corporate control
– Transaction processing
– Reporting
– Data integrity
– Data warehousing
IMS 6217: Distributed Databases
7Dr. Lawrence West, Management Dept., University of Central [email protected]
Exercise #1—The General Electric Case
• Consider the GE product line: Light bulbs, large and small consumer appliances, power generation equipment, jet engines, financial services, railroad locomotives, healthcare equipment http://www.ge.com/products_services/index.html
• Assume that each division has the databases and systems needed for its internal operations
1. What are examples of data needs for GE HQ and what technical capabilities must exist to satisfy the needs?
2. Are there any needs for cross-division data communication? If so, identify technical needs.
IMS 6217: Distributed Databases
8Dr. Lawrence West, Management Dept., University of Central [email protected]
Exercise #2—The J.C. Penney Case
• Consider the operations of the J.C. Penney retail department stores only. More than 1,000 stores throughout US & Puerto Rico.
1. What are examples of data needs for JCP HQ and what technical capabilities must exist to satisfy the needs?
2. Are there unique local needs?
IMS 6217: Distributed Databases
9Dr. Lawrence West, Management Dept., University of Central [email protected]
Exercise #3—The UPS Case
• Consider UPS customer and package tracking systems characterized by millions of daily transactions, package tracking as a competitive tool, worldwide locations, and disconnected (from the network) transactions at the driver delivery transaction.
1. What transactions (if any) are local?
2. What special data sharing must take place?
3. What is the implication of the disconnected drivers?
IMS 6217: Distributed Databases
10Dr. Lawrence West, Management Dept., University of Central [email protected]
Integrating Autonomous Systems
• Consider the GE example with additional conditions
– Diverse nodes have different DBMS software
– Similar tables in different nodes have different structures
• Different column structures
• Different data types for same/similar columns
– Different data integrity requirements
• Will most cross-division DB events be:
– Queries/retrievals?
– Inserts/updates?
IMS 6217: Distributed Databases
11Dr. Lawrence West, Management Dept., University of Central [email protected]
Integrating Autonomous Systems (cont.)
• Data Warehousing—the simple solution
• When corporate needs can be satisfied with a DW solution we can use existing tools
– Ideally suited for disparate data sources
– Transforming disparate data
• Decisions include
– Frequency of updates
– Granularity
– Dimension tables applicable to unique division data
IMS 6217: Distributed Databases
12Dr. Lawrence West, Management Dept., University of Central [email protected]