Utility-Oriented Cloud & Grid Computing: A Vision, Hype, and Reality Dr. Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab Dept. of Computer Science and Software Engineering The University of Melbourne, Australia www.gridbus.org www.buyya.com www.manjrasoft.com Gridbus Sponsors M anjra soft D rR ajkum arB uyya C hief E xecutive Officer M anjrasoftP ty Ltd R oom 5.31,IC T B uilding,111,Barry Street,C arlton, M elbourne,VIC 3053,A ustralia P:+61-3-8344 1344 | F :+61-3-9348 1184 E:[email protected]http://www.manjrasoft.com M an jra soft
56
Embed
Utility-Oriented Cloud & Grid Computing: A Vision, Hype, and Reality Dr. Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab Dept. of Computer.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Utility-Oriented Cloud & Grid Computing: A Vision, Hype, and Reality
Dr. Rajkumar BuyyaGrid Computing and Distributed Systems (GRIDS) LabDept. of Computer Science and Software EngineeringThe University of Melbourne, Australiawww.gridbus.orgwww.buyya.comwww.manjrasoft.com
Gridbus Sponsors
ManjrasoftDr R ajkumar B uyya
C hief E xecutive O fficer
Manjrasoft P ty L tdR oom 5.31, IC T B uilding, 111, B arry S treet, C arlton,
Youngest and one of the rapidly growing research labs in our School/University:
Founded in 2002 Houses 20+ researchers consisting of:
Research Fellows/PostDocs Software Engineers PhD candidates Honours/Masters students
Funding National and International organizations Australian Research Council & DEST Many industries (Sun, StorageTek, Microsoft,
IBM, Microsoft) University-wide collaboration:
Faculties of Science, Engineering, and Medicine
Many national and international collaborations.
Academics Industries
Software: Widely in academic and industrial users.
Publication: My research team produces over 20% of our
Dept’s research output.
EducationR & D
+ Community Services: e.g., IEEE TC for Scalable Computing
Manjrasoft
3
Agenda
Introduction Utility Networks and Grid Computing Application Drivers and Various Types of Grid Services
Global Grids and Challenges Security, resource management, pricing models, …
Service-Oriented Grid Architecture and Gridbus Solutions
Market-based Management, GMD, Grid Bank, Aneka Grid Service Broker
Architecture, Design and Implementation Performance Evaluation: Experiments in Creation
and Deployment of Applications on Global Grids A Case Study in High Energy Physics
Summary and Conclusion
4
“Computer Utilities” Vision: Implications of the Internet
1969 – Leonard Kleinrock, ARPANET project “As of now, computer networks are still in their infancy,
but as they grow up and become sophisticated, we will probably see the spread of ‘computer utilities’, which, like present electric and telephone utilities, will service individual homes and offices across the country”
Computers Redefined 1984 – John Gage, Sun Microsystems
“The network is the computer” 2008 – David Patterson, U. C. Berkeley
“The data center is the computer. There are dramatic differences between of developing software for millions to use as a service versus distributing software for millions to run their PCs”
2008 – “Cloud is the computer” – Buyya!
5
Computing Paradigms and Attributes: Realizing the ‘Computer Utilities’
Vision Web Data Centres Utility Computing Service Computing Grid Computing P2P Computing Market-Oriented
* Since Grids have been around for sometime (early 2000), do we have a unified vision of what Grids can do?
* And did we make sufficient advances to turn vision of “computer utilities” into a
reality?
-- Let us take a look at views of -“industrial” practitioners & “academics”
7
“Industrial” vision of Grid computing
IBM On Demand Computing
Microsoft .NET
Oracle 10g
Sun N1 – Sun Grid Engine
HP Adaptive Enterprise
Amazon Elastic Compute Cloud Services
Manjrasoft Aneka for building enterprise Grids and Clouds.
8
Most academics view: Cyberinfrastructure for conducting
collaborative (e-)Science
Distributed instruments
Distributed computation
Distributed data
Peers sharing ideas and collaborative interpretation of data/results
2100210021002100
2100210021002100
Remote Visualization
Data & Compute Service
Cyberinfrastructure
E-Scientist
9
How do Grids look like?A Bird Eye View of a Global Grid
Grid Resource Broker
Resource Broker
Application
Grid Information Service
Grid Resource Broker
databaseR2R3
RN
R1
R4
R5
R6
Grid Information Service
10
How do Grids look like?A Bird Eye View of a Global Grid
Grid Resource Broker
Resource Broker
Application
Grid Information Service
Grid Resource Broker
databaseR2R3
RN
R1
R4
R5
R6
Grid Information Service
11
How Are Grids Used?
High-performance computing
Collaborative data-sharingCollaborative design
Drug discovery
Financial modeling
Data center automation
High-energy physics
Life sciences
E-Business
E-ScienceNatural language processing
Utility computing
Business Intelligence(Data Mining)
12
Agenda
Introduction Utility Networks and Grid Computing Application Drivers and Various Types of Grid Services
Global Grids and Challenges Security, resource management, pricing models, …
Service-Oriented Grid Architecture and Gridbus Solutions
Market-based Management, GMD, Grid Bank, Aneka Grid Service Broker
Architecture, Design and Implementation Performance Evaluation: Experiments in Creation
and Deployment of Applications on Global Grids A Case Study in High Energy Physics
Summary and Conclusion
13
Grid Challenges
Security
Resource Allocation & Scheduling
Data locality
Network Management
System Management
Resource Discovery
Uniform Access
Computational Economy
Application Construction
14
Some Grid Initiatives Worldwide
Australia Nimrod-G Gridbus DISCWorld GrangeNet. APACGrid ARC eResearch
Brazil OurGrid, EasyGrid LNCC-Grid + many others
China ChinaGrid – Education CNGrid - application
Europe UK eScience EU Grids.. and many more...
India Garuda
Japan NAREGI
Korea...N*Grid
SingaporeNGP
USA Globus TeraGrid Cyberinfrasture AutoMate and many more...
Industry Initiatives IBM On Demand Computing HP Adaptive Computing Sun N1 Microsoft - .NET Oracle 10g Amzon – Elastic Compute Cloud Infosys – Enterprise Grid Satyam – Business Grid Manjrasoft – enterprise Clouds
and Grids and many more
Public Forums Open Grid Forum Conferences:
CCGrid Grid HPDC E-Science
http://www.gridcomputing.com
1.3 billion – 3 yrs
1 billion – 5 yrs
450million – 5 yrs
486million – 5 yrs
1.3 billion (Rs)
27 million
2? billion
120million – 5 yrs
15
Open-Source Grid Middleware Projects
Slide by Hiro
16
Driving Theme:Community vs. Utility Grids
Type
Feature
Community Grids Utility Grids (Now Clouds)
User QoS Best effort Contract/SLA
Service Pricing
Not considered /
free access
Usage, QoS level, Market supply and demand
Example Middleware
Globus, Condor, OMII, Unicore
Nimrod-G, Gridbus, & many inspired efforts (IBM Business Grid, Sun Grid Market)
.. Amazon EC2..
17
The Gridbus Project @ Melbourne:Enable Leasing of ICT Services on Demand
WWG
Pushes Grid computing into mainstream
computing
Gridbus
18
The Gridbus Project @ GRIDS Lab, The University of Melbourne: Toolkit for Creating and Deploying e-* Applications on Utility Grids
The Gridbus Project @ GRIDS Lab, The University of Melbourne: The Gridbus Project @ GRIDS Lab, The University of Melbourne: Toolkit for Creating and Deploying Toolkit for Creating and Deploying ee--** Applications on Utility GridsApplications on Utility Grids
Gridbus
Distributed Data
http://www.gridbus.org
• Gridbus is a “open source” Grid R&D project with focus on Grid Economy, Utility Grids and Service Oriented Computing.
Introduction Utility Networks and Grid Computing Application Drivers and Various Types of Grid Services
Global Grids and Challenges Security, resource management, pricing models, …
Service-Oriented Grid Architecture and Gridbus Solutions
Market-based Management, GMD, Grid Bank, Aneka Grid Service Broker
Architecture, Design and Implementation Performance Evaluation: Experiments in Creation
and Deployment of Applications on Global Grids A Case Study in High Energy Physics
Summary and Conclusion
20
What do Grid players want & require?
Grid Service Consumers (GSCs): - minimize expenses, meet QoS How do I express QoS requirements ? How do I trade between timeframe & cost ? How do I discover services and map jobs to meet my QoS needs? How do I manage Grid dynamics and get my work done? …
Grid Service Providers (GSPs):– maximise ROI How do I decide service pricing models ? How do I specify them ? How do I translate them into resource allocations ? How do I enforce them ? How do I advertise & attract consumers ? How do I do accounting and handle payments? …
They need mechanisms, tools and technologies that help them in value expression, value translation, and value enforcement.
21
Grid Node N
Service-Oriented Grid Architecture
Grid Service Consumer
Pro
gra
mm
ing
En
viro
nm
ents
Grid Resource Broker
Grid Explorer
Schedule Advisor
Trade Manager
Job ControlAgent
Deployment Agent
Information Service
Pricing Algorithms
Grid Node1
…
Core Middleware Services
…
…
HealthMonitor
Grid Market Services
JobExec
Info ?
Secure
Trading
QoS
Storage
Sign-on
Grid Bank
Ap
pli
cati
on
s
Data Catalogue
Grid Service Providers
Trade Server
Resource Allocation
ResourceReservation
R1
Misc. services
R2 Rm…
Accounting
22
Market-Oriented Grid Software: A union of Gridbus and other
Divide the problem in to multiple small tasks and distribute them run in parallel on multiple computers within a Cloud.
33
User scenario: GoFront(unit of China Southern Railway Group)
Aneka utilizes idle desktops (30) to decrease task time
from days to hours
Time (in hrs)
Single Server
Aneka Cloud
Raw Locomotive Design Files(Using AutoDesk Maya) Using Maya
Graphical Mode Directly
Case 1: Single Server
4 cores server
Aneka Maya Renderer
Use private Aneka Cloud
GoFront Private Aneka Cloud
LAN network (Running Maya Batch Mode on
demand)
Case 2: Aneka Enterprise Cloud Manjrasoft
Application: Locomotive design CAD rendering
34
Aneka: How can get it?
Available to Download: Software: www.manjrasoft.com Manual: Setting up Cloud using your LAN-network computers
Teaching material parallel and distributed computing and programming, List of possible assignments for students Possible Projects for Final year students..
Price – highly affordable = Fee you charge to 1 student (each year) and all
students/teachers in entire college/university can use it! Applications
Other Departments (Physics, Chemistry, Biology, Finance, Engineering) can use it for their applications.
35
ASP Catalogue
Grid Info Service
Grid Market DirectoryASP Catalogue
Grid Info Service
Grid Market Directory
GSP(Accounting Service)
GridbusGridBank
GSP(Accounting Service)
GridbusGridBank
GSP(e.g., Microsoft)
PEGSP
(e.g., Amazon)
PE
GSP(e.g., IBM)
CPUorPE
Grid Service (GS)(Globus) Aneka
EC2
GTS
Resource Allocation
GSP(e.g., Microsoft)
PEGSP
(e.g., Amazon)
PE
GSP(e.g., IBM)
CPUorPE
Grid Service (GS)(Globus) Aneka
EC2
GTS
Resource Allocation
Job
8
Job
Job
Job
8
GridResource Broker
2
GridResource Broker
22
Visual Application Composer
Application CodeExplore
data1
Visual Application Composer
Application CodeExplore
data1
3366
4455
Res
ults
9
Res
ults
Res
ults
9 77
Results+
Cost Info
10
Results+
Cost Info
10
1111
Bill
12
BillBill
12Data CatalogueData Catalogue
On Demand Assembly of Services in Market-Oriented Grid Environments
36
Agenda
Introduction Utility Networks and Grid Computing Application Drivers and Various Types of Grid Services
Global Grids and Challenges Security, resource management, pricing models, …
Service-Oriented Grid Architecture and Gridbus Solutions
Market-based Management, GMD, Grid Bank, Aneka Grid Service Broker
Architecture, Design and Implementation Performance Evaluation: Experiments in Creation
and Deployment of Applications on Global Grids A Case Study in High Energy Physics
Summary and Conclusion
37
A resource broker for scheduling task farming data Grid applications with static or dynamic parameter sweeps on global Grids.
It uses computational economy paradigm for optimal selection of computational and data services depending on their quality, cost, and availability, and users’ QoS requirements (deadline, budget, & T/C optimisation)
Key Features A single window to manage & control experiment Programmable Task Farming Engine Resource Discovery and Resource Trading Optimal Data Source Discovery Scheduling & Predications Generic Dispatcher & Grid Agents Transportation of data & sharing of results Accounting
Grid Service Broker (GSB)
38
Core Middleware
Gridbus User Console/Portal/Application Interface
Grid Info Server
Schedule Advisor
Trading Manager
Gridbus Farming Engine
RecordKeeper
Grid Explorer
GE GIS, NWSTM TS
RM & TS
Grid Dispatcher
G
G
CU
Globus enabled node.
AL
DataCatalog
DataNode
Amazon EC2/S3 Cloud.
$
$
$
App, T, $, Optimization Preference
workload
Gridbus Broker
39
Gridbus Broker: Separating “applications” from “different” remote service access
enablers and schedulers
Aneka
AMI
Amazon EC2Data Store
Access Technology
Grid FTPSRB
-PBS-Condor-SGE
Globus
Job manager
fork() batch()
Gridbusagent
Data Catalog
-PBS-Condor-SGE-XGrid
SSH
fork()
batch()
Gridbusagent
Single-sign on securityHome Node/Portal
GridbusBroker
fork()
batch() -PBS-Condor-SGE-Aneka-XGrid
Application Development Interface
Sch
ed
ulin
gIn
terfa
ces
Alogorithm1
AlogorithmN
Plugin Actuators
40
Gridbus Services for eScience applications
Application Development Environment: XML-based language for composition of task farming (legacy)
applications as parameter sweep applications. Task Farming APIs for new applications. Web APIs (e.g., Portlets) for Grid portal development. Threads-based Programming Interface Workflow interface and Gridbus-enabled workflow engine. … Grid Superscalar – in cooperation with BSC/UPC
Resource Allocation and Scheduling Dynamic discovery of optional computational and data nodes that
meet user QoS requirements. Hide Low-Level Grid Middleware interfaces
Globus (v2, v4), SRB, Aneka, Unicore, and ssh-based access to local/remote resources managed by XGrid, PBS, Condor, SGE.
41
Drug DesignMade Easy!
Click Here for Demo
42
s
A Sample List of Gridbus Broker UsersA Sample List of Gridbus Broker UsersA Sample List of Gridbus Broker Users
http://www.gridbus.org
Molecular docking for drug design on Australian National Grid
Molecular docking for drug design on Australian National Grid
High Energy Physics: Particle Discovery
High Energy Physics: Particle Discovery
Melbourne University
NeuroScience: Brain Activity Analysis
NeuroScience: Brain Activity Analysis
EU Data Mining GridEU Data Mining Grid
DaimlerChrysler, Technion, U. Ljubljana, U. Ulster
Kidney/Human Physiome Modelling
Kidney/Human Physiome Modelling
Melbourne Medical Faculty, Université d'Evry, France
Introduction Utility Networks and Grid Computing Application Drivers and Various Types of Grid Services
Global Grids and Challenges Security, resource management, pricing models, …
Service-Oriented Grid Architecture and Gridbus Solutions
Market-based Management, GMD, Grid Bank, Aneka Grid Service Broker
Architecture, Design and Implementation Performance Evaluation: Experiments in Creation
and Deployment of Applications on Global Grids A Case Study in High Energy Physics
Summary and Conclusion
44
Case Study: High Energy Physics and Data Grid
The Belle Experiment KEK B-Factory, Japan Investigating fundamental violation
of symmetry in nature (Charge Parity) which may help explain “why do we have more antimatter in the universe OR imbalance of matter and antimatter in the universe?”.
Collaboration 1000 people, 50 institutes
100’s TB data currently
45
Case Study: Event Simulation and Analysis
B0->D*+D*-Ks
• Simulation and Analysis Package - Belle Analysis Software Framework (BASF)• Experiment in 2 parts – Generation of Simulated Data and Analysis of the distributed data
Analyzed 100 data files (30MB each) that were distributed among the five nodes within Australian Belle DataGrid platform.
46
Australian Belle Data Grid Testbed
Grid Service Broker
Replica Catalog
AARNET
NWS NameServer
VirtualOrganization
Analysis Request
Analysis Results
CertificateAuthority
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
GRIDS Lab, University of Melbourne
Dept. of Physics,University of Sydney
ANU, Canberra
Dept. of Computer Science, University of Adelaide
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Intel Pentium 2.0 Ghz, 512 MB RAM
Dept. of Physics,University of Melbourne
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
VPACMelbourne
47
Belle Data Grid (GSP CPU Service Price: G$/sec)
Grid Service Broker
Replica Catalog
AARNET
NWS NameServer
VirtualOrganization
Analysis Request
Analysis Results
CertificateAuthority
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
GRIDS Lab, University of Melbourne
Dept. of Physics,University of Sydney
ANU, Canberra
Dept. of Computer Science, University of Adelaide
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Intel Pentium 2.0 Ghz, 512 MB RAM
Dept. of Physics,University of Melbourne
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NA
G$4
G$4
Datanode
G$6VPAC
MelbourneG$2
48
Belle Data Grid (Bandwidth Price: G$/MB)
Grid Service Broker
Replica Catalog
AARNET
NWS NameServer
VirtualOrganization
Analysis Request
Analysis Results
CertificateAuthority
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
GRIDS Lab, University of Melbourne
Dept. of Physics,University of Sydney
ANU, Canberra
Dept. of Computer Science, University of Adelaide
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Intel Pentium 2.0 Ghz, 512 MB RAM
Dept. of Physics,University of Melbourne
NWSSensor
GridFTPGRIS
GlobusGatekeeper
Dual Intel Xeon 2.8 Ghz, 2 GB RAM
NA
G$4
G$4
Datanode
G$6VPAC
MelbourneG$2
34
31
38
31
30
3336
32
49
Deploying Application Scenario
A data grid scenario with 100 jobs and each accessing remote data of ~30MB