Top Banner
Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu Clouds: An Opportunity for Scientific Applications? Ewa Deelman USC Information Sciences Institute
38

Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

May 28, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Clouds: An Opportunity for Scientific Applications?

Ewa DeelmanUSC Information Sciences Institute

Page 2: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

AcknowledgementsYang-Suk Ki (former PostDoc, USC)Gurmeet Singh (former Ph.D. student, USC)Gideon Juve (Ph.D. student, USC)Tina Hoffa (Undergrad, Indiana University)Miron Livny (University of Wisconsin, Madison)Montage scientists: Bruce Berriman, John Good, and othersPegasus team: Gaurang Mehta, Karan Vahi, others

Page 3: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

OutlineBackground

Science ApplicationsWorkflow Systems

The opportunity of the CloudVirtualizationOn-demand availability

Simulation study of an astronomy application on the CloudConclusions

Page 4: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Ewa [email protected]

Scientific Applications

ComplexInvolve many computational stepsRequire many (possibly diverse resources)Often require a custom execution environment

Composed of individual application componentsComponents written by different individualsComponents require and generate large amounts of dataComponents written in different languages

Page 5: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Issues Critical to Scientists

Reproducibility of scientific analyses and processes is at the core of the scientific methodScientists consider the “capture and generation of provenance information as a critical part of the <…> generated data”“Sharing <methods> is an essential element of education, and acceleration of knowledge dissemination.”

NSF Workshop on the Challenges of Scientific Workflows, 2006, www.isi.edu/nsf-workflows06Y. Gil, E. Deelman et al, Examining the Challenges of Scientific Workflows. IEEE Computer, 12/2007

Page 6: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Computational challenges faced by applications

Be able to compose complex applications from smaller componentsExecute the computations reliably and efficientlyTake advantage of any number/types of resourcesCost is an issue

Cluster, Shared CyberInfrastructure (EGEE, Open Science Grid, TeraGrid), Cloud

Page 7: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Possible solution

Structure an application as a workflowDescribe data and components in logical termsCan be mapped onto a number of execution environmentsCan be optimized and if faults occur the workflow management system can recover

Use a workflow management system (Pegasus-WMS) to manage the application on a number of resources

Page 8: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Pegasus-Workflow Management System

Leverages abstraction for workflow description to obtain ease of use, scalability, and portabilityProvides a compiler to map from high-level descriptions to executable workflows

Correct mappingPerformance enhanced mapping

Provides a runtime engine to carry out the instructions (Condor DAGMan)

Scalable mannerReliable manner

Can execute on a number of resources: local machine, campus cluster, Grid, Cloud

Page 9: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Mapping Correctly

Select where to run the computationsApply a scheduling algorithm for computation tasksTransform task nodes into nodes with executable descriptions

Execution locationEnvironment variables initializesAppropriate command-line parameters set

Select which data to accessAdd stage-in nodes to move data to computationsAdd stage-out nodes to transfer data out of remote sites to storageAdd data transfer nodes between computation nodes that execute on different resources

Page 10: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Additional Mapping Elements

Add data cleanup nodes to remove data from remote sites when no longer needed

reduces workflow data footprint Cluster compute nodes in small computational granularity applicationsAdd nodes that register the newly-created data productsProvide provenance capture steps

Information about source of data, executables invoked, environment variables, parameters, machines used, performance

Scale matters--today we can handle:1 million tasks in the workflow instance (SCEC)10TB input data (LIGO)

Page 11: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Science-grade Mosaic of the Sky

Image Courtesy of IPAC, Caltech

Point on the sky, area

Page 12: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu*The full moon is 0.5 deg. sq. when viewed form Earth, Full Sky is ~ 400,000 deg. sq.

Generating mosaics of the sky (Bruce Berriman, Caltech)

20,652

8,586

4,856

1,444

232

Number of jobs

3,722

1,444

747

212

53

Number of input data files

97GB

38GB

20GB

5.5GB

1.2GB

Total data footprint

2 hrs. 14 mins22,8506

1hr 46 mins13,0614

49 mins3,9062

6 hours54,43410

40 mins5881

Approx. execution time (20 procs)

Number of Intermediate files

Size of the mosaic is degrees square*

Page 13: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Types of Workflow Applications

Providing a service to a community (Montage project)Data and derived data products available to a broad range of usersA limited number of small computational requests can be handled locallyFor large numbers of requests or large requests need to rely on shared cyberinfrastructure resourcesOn-the fly workflow generation, portable workflow definition

Supporting community-based analysis (SCEC project)Codes are collaboratively developed Codes are “strung” together to model complex systemsAbility to correctly connect components, scalability

Processing large amounts of shared data on shared resources(LIGO project)

Data captured by various instruments and cataloged in community data registries. Amounts of data necessitate reaching out beyond local clusters Automation, scalability and reliability

Automating the work of one scientist (Epigenomic project, USC)Data collected in a lab needs to be analyzed in several stepsAutomation, efficiency, and flexibility (scripts age and are difficult to change)Need to have a record of how data was produced

Page 14: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

OutlineBackground

Science ApplicationsWorkflow Systems

The opportunity of the CloudVirtualizationAvailability

Simulation study of an astronomy application on the CloudConclusions

Page 15: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

CloudsOriginated in the business domainOutsourcing services to the CloudPay for what you useProvided by data centers that are built on compute and storage virtualization technologies. Scientific applications often have different requirements

MPIShared file systemSupport for many dependent jobs

Container-based Data Center

Page 16: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Available Cloud Platforms

Commercial ProvidersAmazon EC2, Google, others

Science CloudsNimbus (U. Chicago), Stratus (U. Florida)Experimental

Roll out your own using open source cloud management software

Virtual Workspaces (Argonne), Eucalyptus (UCSB), OpenNebula (C.U. Madrid)

Many more to come

Page 17: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Cloud Benefits for Grid ApplicationsSimilar to the Grid

Provides access to shared cyberinfrastructureCan recreate familiar grid and cluster architectures (with additional tools)Can use existing grid software and tools

Resource ProvisioningResources can be leased for entire application instead of individual jobsEnables more efficient execution of workflows

Customized Execution EnvironmentsUser specifies all software components including OSAdministration performed by user instead of resource provider (good [user control] and bad [extra work])

Page 18: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Amazon EC2 Virtualization

Virtual NodesYou can request a certain class of machinePrevious research suggests 10% performance hitMultiple virtual hosts on a single physical hostYou have to communicate over a wide-area network

Virtual Clusters (additional software needed)Create cluster out of virtual resourcesUse any resource manager (PBS, SGE, Condor)Dynamic configuration is the key issue

Page 19: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Personal Cluster

GT4/PBS

Batch

Resources

Com

pute C

louds

Private Queue

System Queue

No Job manager

Resource & execution environment

Private Cluster on Demand

Work by Yang-Suk Kee at USC

Can set up NFS, MPI, ssh

Page 20: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

EC2 Software EnvironmentSpecified using disk images

OS snapshot that can be started on virtualized hostsProvides portable execution environment for applicationsHelps with reproducibility for scientific applications

Images for a workflow application can contain:Application CodesWorkflow Tools

Pegasus, DAGMan

Grid ToolsGlobus Gatekeeper, GridFTP

Resource ManagerCondor, PBS, SGE, etc.

Page 21: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

EC2 Storage OptionsLocal Storage

Each EC2 node has 100-300 GB of local storageUsed for image too

Amazon S3Simple put/get/delete operations Currently no interface to grid/workflow software

Amazon EBSNetwork accessible block-based storage volumes (c.f. SAN)Cannot be mounted on multiple workers

NFSDedicated node exports local storage, other nodes mount

Parallel File Systems (Lustre, PVFS, HDFS)Combine local storage into a single, parallel file systemDynamic configuration may be difficult

Page 22: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Montage/IPAC SituationProvides a service to the community

Delivers data to the communityDelivers a service to the community (mosaics)

Have their own computing infrastructureInvests ~ $75K for computing (over 3 years)Appropriates ~ $50K in human resources every year

Expects to need additional resources to deliver servicesWants fast responses to user requests

Page 23: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Cloudy QuestionsApplications are asking:

What are Clouds? How do I run on them?

How do I make good use of the cloud so that I use my funds wisely?

And how do I explain Cloud computing to the purchasing people?

How many resources do I allocate for my computation or my service?How do I manage data transfer in my cloud applications?How do I manage data storage—where do I store the input and output data?

Page 24: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

OutlineBackground

Science ApplicationsWorkflow Systems

The opportunity of the CloudVirtualizationAvailability

Simulation study of an astronomy application on the CloudConclusions

Page 25: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Montage Infrastructure

Page 26: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Montage Infrastructure

Page 27: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Computational ModelBased on Amazon’s fee structure

$0.15 per GB-Month for storage resources$0.1 per GB for transferring data into its storage system$0.16 per GB for transferring data out of its storage system$0.1 per CPU-hour for the use of its compute resources

Normalized to cost per secondDoes not include the cost of building and deploying an imageSimulations done using a modified Gridsim

Page 28: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

How many resources to provision?

Montage 1 Degree Workflow 203 Tasks60 cents for the 1 processor computation versus almost $4 with 128 processors, 5.5 hours versus 18 minutes

Page 29: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

4 Degree Montage

3,027 application tasks1 processor $9, 85 hours; 128 processors, 1 hour with and $14.

Page 30: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Data Management Modes

Remote I/O

Regular

Cleanup

0

1

2

a

b

b

c

0

1

2

Ra

Rb

Rb

Wb

Good for non-shared file systems

WcRc

1.25GB versus 4.5 GB

Page 31: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

How to manage data?

1 Degree Montage 4 Degree Montage

Page 32: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

How do data cost affect total cost?

Data stored outside the cloudComputations run at full parallelismPaying only for what you use

Assume you have enough requests to make use of all provisioned resources

Cost in $

Page 33: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Where to keep the data?Storing all of 2 Mass data

12 TB of data $1,800 per month on the CloudCalculating a 1 degree mosaic and delivering it to the user $2.22 (with data outside the cloud)Same mosaic but data inside the cloud: $2.12To overcome the storage costs, users would need to request at least $1,800/($2.22-$2.12) = 18,000 mosaics per month Does not include the initial cost of transferring the data to the cloud, which would be an additional $1,200 Is $1,800 per month reasonable?

~$65K over 3 years (does not include data access costs from outside the cloud)Cost of 12TB to be hosted at Caltech $15K over 3 years for hardware

Page 34: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

The cost of doing scienceComputing a mosaic of the entire sky (3,900 4-degree-square mosaics)

3,900 x $8.88 = $34,632 How long it makes sense to store a mosaic?

Storage vs computation costs

2.3GB

558MB

173MB

Mosaic size

25.12 months$8.404 degree^2

24.25 months$2.032 degree^2

21.52 months$0.561 degree^2

Length of time to save

Costof generation

Page 35: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

SummaryWe started asking the question of how can a scientific workflow best make use of cloudsAssumed a simple cost model based on the Amazon fee structureConducted simulations

Need to find balance between cost and performanceComputational cost outweighs storage costs

Storing data on the Cloud is expensiveDid not explore issues of data security and privacy, reliability, availability, ease of use, etc

Page 36: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Will scientific applications move into clouds?

There is interest in the technology from applicationsThey often don’t understand what are the implicationsNeed tools to manage the cloud

Build and deploy imagesRequest the right number of resourcesManage costs for individual computationsManage project costs

Projects need to perform cost/benefit analysis

Page 37: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Issues Critical to Scientists

Reproducibility – yes—maybe--through virtual images, if we package the entire environment, the application and the VMsbehaveProvenance – still need tools to capture what happenedSharing – can be easier to share entire images and data

Data could be part of the image

Page 38: Clouds: An Opportunity for Scientific Applications? · zHelps with reproducibility for scientific applications zImages for a workflow application can contain: zApplication Codes zWorkflow

Ewa Deelman, [email protected] www.isi.edu/~deelman pegasus.isi.edu

Relevant LinksAmazon Cloud: http://aws.amazon.com/ec2/Pegasus-WMS: pegasus.isi.eduDAGMan: www.cs.wisc.edu/condor/dagman

Gil, Y., E. Deelman, et al. Examining the Challenges of Scientific Workflows. IEEE Computer, 2007.Workflows for e-Science, Taylor, I.J.; Deelman, E.; Gannon, D.B.; Shields, M. (Eds.), Dec. 2006

LIGO: www.ligo.caltech.edu/SCEC: www.scec.orgMontage: montage.ipac.caltech.edu/Condor: www.cs.wisc.edu/condor/