Top Banner
Cloud Research Matt Wood TECHNOLOGY EVANGELIST
68

8 mattwoodaws-intro-pdf-110411093115-phpapp01

Jun 26, 2015

Download

Technology

Carl Chesal

Matt Wood of AWS
"Cloud Research"
Europe April 2011 @ the Eagle Genomics Symposium
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Cloud Research

Matt WoodT E C H N O L O G Y E V A N G E L I S T

Page 2: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Hello.

Page 3: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Text

Page 4: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Thank you.

Page 5: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

The Cloud by Example

Page 6: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

The Cloud by Example

Page 7: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Infrastructureservices

Page 8: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

?

Page 9: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

On demand

Page 10: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Pay as you go

Page 11: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Pay for what you use

Page 12: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Elastic capacity

Page 13: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Capacity

Time

Estimated demand

Page 14: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Capacity

Time

Estimated demand

Investment

Infrastructure

Page 15: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Capacity

Time

Realdemand

Infrastructure

Page 16: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Capacity

Time

Realdemand

Elasticcapacity

Page 17: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Agility

Page 18: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Faster to prototype

Page 19: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Faster to production

Page 20: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Undifferentiated heavy lifting

Page 21: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Tools for accelerating

research

Page 22: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

0

75

150

225

300

Q4 2006Q4 2007

Q4 2008Q4 2009

Q4 2010

Page 23: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

The Cloud by Example

Page 24: 8 mattwoodaws-intro-pdf-110411093115-phpapp01
Page 25: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Data management

Page 26: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Biomarker Warehousepre-clinical, clinical, 3rd party data and publications

!"#$%"&&'

!#%&$(%&&&'

!)*(%"&&'

+,'-./01'

23,3415'61789:1'

;<./5'=>?6@'

6178170' 6A.7341' B817-135'

Estimated cost: 10 TB warehouse over 3 years

Page 27: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Data processing

Page 28: 8 mattwoodaws-intro-pdf-110411093115-phpapp01
Page 29: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

http://cyclecomputing.com

Page 32: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

http://www.rightscale.com

Page 33: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Input  S3  bucket

Output  S3  bucket

Amazon S3

Hadoop

Amazon EC2 Instances

Input dataset

outputresults

Deploy Application

Web Console, Command line tools

End

Notify

Get ResultsInput Data

Amazon Elastic MapReduce

Hadoop Hadoop

Hadoop

Hadoop

Hadoop

Elastic MapReduce

Elastic MapReduce

Page 34: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Preprocessed reads

Map: Bowtie

Sort: Bin and partition

Reduce: SoapSNP

Crossbow: Rapid whole genome SNP analysis

Langmead B, Schatz MC, Lin, J, Pop M, Salzberg SL. Genome Biol 10(11): R134.

Page 35: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

CloudBurst

Catalog k-mers Collect seeds End-to-end alignment

http://cloudburst-bio.sourceforge.net; Bioinformatics 2009 25: 1363-1369

Page 36: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

ASSEMBLING GENOMES

140  million  454  reads

Image:  Ma)  Wood

Page 37: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Map 100 million, 100 base paired end readsQuad core with 5 GB of RAM would take 16 days

30 high-memory instances; 32 hours; $195

BLAT @ U. PENN

Page 38: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

HEAVY-ION COLLISIONS @ RHIC

Problem: Quark physics conference imminent but no compute resources handy

Solution: NIMBUS context broker allowed researchers to provision 300 nodes and get the simulations done

Page 39: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Collaboration

Page 41: 8 mattwoodaws-intro-pdf-110411093115-phpapp01
Page 42: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

http://www.cloudbiolinux.com/

Page 43: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

http://usegalaxy.org/cloud

Page 44: 8 mattwoodaws-intro-pdf-110411093115-phpapp01
Page 45: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Applications and platforms

Page 46: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

http://heroku.com

Page 47: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

http://chempedia.com/

Page 48: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Security

Page 49: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Shared responsibility

Page 50: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Requirement based access

Page 51: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Certification

Page 52: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

ISO 27001+

SAS 70 Type II

Page 53: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

PCI DSSLevel 1

Page 54: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Security organisation Employee lifecycle

Logical security Secure data handling

Physical security Environmental safeguards

Change management Incident handling

Data integrity Availability and redundancy

Control objectives

Page 55: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Data access control

Page 56: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Identity and access

Page 57: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Independent buildings Separate flood zonesGeographically

separated

Redundantpower

Redundant connectivity Highly monitored

Page 58: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Default deny firewall

Page 59: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Security groups

Page 60: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

DDOSMan in the Middle

IP spoofing

Page 61: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Resource isolationVirtual Private Cloud

Page 62: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Amazon Web Services infrastructure

Secure VPN connection over the internet

VPN Gateway Router

Customer’s isolated AWS resources

Subnet 1 Subnet 2

Subnet 4Subnet 3

Customer’s network

Page 63: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Dedicated instancesVirtual Private Cloud

Page 64: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

aws.amazon.com/security

Page 65: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Data stays local

Page 66: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

aws.amazon.com

Page 67: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

Thank you!

Page 68: 8 mattwoodaws-intro-pdf-110411093115-phpapp01

[email protected]

Q U E S T I O N S + C O M M E N T S

@mzaO N T W I T T E R