Top Banner
© 2010 Wipro Ltd - Confidential Teradata Overview Part-1 Amruta Patil
35
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential

Teradata Overview

Part-1

Amruta Patil

Page 2: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential2 © 2010 Wipro Ltd - Confidential2

End of the session you will have gained an understanding on following

§ Teradata overview.

Objective of the course

Page 3: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential3 © 2010 Wipro Ltd - Confidential3

Agenda

3

4 Teradata Never Have to Do

1 What is Teradata ?

2 Teradata Architecture

Teradata Advantages

5 Summary

Page 4: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential4

What is Teradata ?

Page 5: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential5 © 2010 Wipro Ltd - Confidential5

Teradata is a relational database management system (RDBMS) that is:

§ An open system, running on a UNIX MP-RAS or Windows 2000 server platform.

§ Capable of supporting many concurrent users from various client platforms (over a TCP/IP or IBM channel connection).

§ Built on a parallel architecture.§ “Share Nothing” architecture.§ Linear Scalability.§ Automatic, even data distribution

What is Teradata?

Page 6: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential6

Teradata Architecture

Page 7: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential7 © 2010 Wipro Ltd - Confidential7

Teradata Architecture

Page 8: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential8

Teradata Advantages

Page 9: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential9 © 2010 Wipro Ltd - Confidential9

§ Large capacity database machine: Teradata handles the large data storage requirements to process the large amounts of detail data for decision support.

· Billions of rows· Terabytes of data

§ Performance: Early relational systems suffered severe performancelimitations as table size increased. Teradata addresses theperformance issues of large databases.

§ Single data store for multiple clients: Instead of replicating adatabase for different hosts, with Teradata you store it once and use it for all clients. This is what's known as "a single version of the truth".

§ Connectivity: Teradata connects easily to network-attached hostsystems as well as mainframe hosts.

Teradata Advantages

Page 10: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential10 © 2010 Wipro Ltd - Confidential10

§ Standard access language (SQL): SQL has been adapted as theindustry standard for relational databases.

§ Manageable growth: Teradata systems are linearly expandable toallow for growth without performance drop-off.

§ Fault tolerance: Teradata can automatically detect and recover from one or more hardware failures.

§ Data integrity: To guarantee the integrity of the data, transactions are either completed or, if a fault occurs, rolled back.

Teradata Advantages contd..

Page 11: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential11

Teradata Never Have to Do

Page 12: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential12 © 2010 Wipro Ltd - Confidential12

Teradata DBAs are not required to do the following :

§ Reorganize data or index space.§ Pre-allocate table/index space and format partitioning. While it is

possible to have partitioned indexes in Teradata, they are notrequired.

§ Pre-prepare data for loading (convert, sort, split, etc.).§ Ensure that queries run in parallel.§ Unload/reload data spaces due to expansion. With Teradata, the

data can be redistributed on the larger configuration with nounloading and reloading required.

Teradata Never Have to Do

Page 13: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential13

Summary

Page 14: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential14 © 2010 Wipro Ltd - Confidential14

§ Teradata is designed and used for enterprise data warehousing.§ Designed and built with parallelism from day one……

“The secret to data warehousing is not just the ability to lay down aterabyte of data onto disk. Lots of vendors can do that. The trick is to be able to access it and use it in a meaningful way. That means lots of users doing lots of queries, some ad hoc, some planned, all getting what they need in a timely fashion.”

Summary

Page 15: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential

Teradata Overview

Part-2

Amruta Patil

Page 16: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential16 © 2010 Wipro Ltd - Confidential16

End of the session you will have gained an understanding on following

§ Teradata overview – Part 2.

Objective of the course

Page 17: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential17 © 2010 Wipro Ltd - Confidential17

Agenda

3

4 Major Components of Teradata system

1 Describe a node

2 SMP and MMP

Clique

5 Summary

Page 18: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential18

Describe a node.

Page 19: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential19 © 2010 Wipro Ltd - Confidential19

Describe a node.

Page 20: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential20

SMP and MPP

Page 21: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential21 © 2010 Wipro Ltd - Confidential21

A node is made up of various hardware and software components. All applications run under UNIX , or Windows, and all Teradata software runs under PDE. All share the resources of CPU and memory on the node.

§ Teradata software, the LAN gateway, and channel-driver software run as processes.

§ AMPs and PEs are virtual processors (vprocs) which run under Parallel Database Extensions (PDE).

§ AMPs are associated with virtual disks (vdisks).§ A single node is called a Symmetric Multi-Processor (SMP).

Symmetric Multi-Processor (SMP)

Page 22: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential22 © 2010 Wipro Ltd - Confidential22

Massively Parallel Processing

Page 23: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential23 © 2010 Wipro Ltd - Confidential23

When multiple SMP nodes are connected to form a larger configuration,we refer to this as a Massively Parallel Processing (MPP) system.

§ Multiple nodes are configured into a Massively Parallel Processing (MPP) system.

§ A physical message-passing layer called the BYNET is used to interconnect multiple nodes.

§ Teradata is linearly expandable—as your database grows, additional nodes may be added.

§ The BYNET can support 512 nodes.

Massively Parallel Processing contd…

Page 24: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential24

Clique

Page 25: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential25 © 2010 Wipro Ltd - Confidential25

Clique

Page 26: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential26 © 2010 Wipro Ltd - Confidential26

A clique is a set of Teradata nodes that share a common set of disk arrays. Cabling a subset of nodes to the same disk arrays creates a clique.

§ A clique is a defined set of nodes with failover capability.§ All nodes in a clique are able to access the vdisks of all AMPs in the clique.§ If a node fails, its vprocs will migrate to the remaining nodes

in the clique.§ Each node can support 128 vprocs

Clique contd…

Page 27: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential27

Major components of Teradata system

Page 28: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential28 © 2010 Wipro Ltd - Confidential28

§ Parsing Engine (PE) : The Parsing Engine interprets the SQL command and converts the data record from the host into an AMP message– Managing Individual sessions (up to 120)– Parsing and optimizing SQL requests– Dispatching the optimized plan to the AMPs– Sending the answer set response back to the requesting client

Major Components of a Teradata System

Page 29: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential29 © 2010 Wipro Ltd - Confidential29

§ AMP : The AMP is a virtual processor (vproc) designed for and dedicated to managing a portion of the entire database. It performs all database management functions such as sorting, aggregating, and formatting data. The AMP receives data from the PE, formats rows, and distributes them to the disk storage units it controls. The AMP also retrieves the rows requested by the Parsing Engine.– Storing and retrieving rows to and from the disks– Lock Management– Sorting rows and aggregating columns– Join Processing– Output conversions and formatting

Major Components of a Teradata System contd…

Page 30: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential30 © 2010 Wipro Ltd - Confidential30

§ BYNET : The BYNET is the message-passing layer. It determines which AMP(s) (Access Module Processor) should receive a message. – Point-to-point communications between nodes and virtual processors– Merging answer sets back to the PE– making Teradata parallelism possible

Major Components of a Teradata System contd…

Page 31: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential31

Summary

Page 32: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential32 © 2010 Wipro Ltd - Confidential32

§ SMP and MMP.§ Components of Teradata System.§ Clique

Summary

Page 33: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential33 © 2010 Wipro Ltd - Confidential33

Q&A

Page 34: Teradata Part 1 and Part - 2

© 2010 Wipro Ltd - Confidential

Thank You

Amruta Patil

ADM Support

[email protected]

Page 35: Teradata Part 1 and Part - 2

© 2009 Wipro Ltd - Confidential35 © 2010 Wipro Ltd - Confidential35

v http://www.teradata.com/

References