Computação em Nuvem Interplanetária: Como a Nuvem é utilizada pela comunidade científica e de astronomia
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Simulation by Mark Thompson of the University of Southern California to see which of 205,000 organic compounds could be used for photovoltaic cells for solar panel material.
1.21 petaFLOPS (Rpeak)$68M => $33,000
Estimated computation time 264 years completed in 18 Hours
“… A 156,314-core …, totaling 1.21 petaFLOPS…, to simulate 205,000 materials, crunched 264 compute years in only 18 hours”
Amazon EC2
Resizable compute capacity
Complete control of your computing resources
Reduces the time required to obtain and boot new server instances to
minutes
On-DemandInstances
ReservedInstances
SpotInstances
Flexible Pricing Options
Pay as you go for computing power
Pay only for what you use, no up-front commitments or long-term contracts
Pay an up-front fee and receive a significant discount on the hourly pricing for that instance
Also helps ensure that compute capacity is available when needed
1- or 3-year terms
Bid on available EC2 capacity
If the current Spot Price is below your bid, your instances will start
If there is a capacity constraint, your instances may be evicted
“Em muitos casos, a computação em nuvem pode ser mais segura que sua infraestrutura interna”
Tom Soderstrom, JPL CTO @ Re:Invent 2013 Hackaton
Armazenamento e Big Data
Volume
“2.5 quintilhões (10ˆ18) de bytes /
day”
“[D,D-1] > [0,2003]”
Velocidade
"... 0.5 segundos na página de pesquisa pode reduzir o tráfego em 20%”
"... 100ms de latência pode custar 1% em
vendas”
"... um corretor pode perder $4 milhões por ms se sua plataforma ficar 5
ms atrasada"
Variedade
“~49% dos dados estão em formatos
não estruturados ou semiestruturados”
Mais dados vs Algoritmos melhores
Storage Options
• Simple Storage Service (S3) e Glacier• Designed for high durability 99.999999999%
• Elastic Block Store (EBS)• Between 0.1% and 0.5% AFR per volume
• Local Instance Storage• Up to 48 terabytes per instance (spinning
disks)• Up to 5.7 terabytes of SSD storage
Amazon S3
Storage for the Internet. Natively online, HTTP access
Store and retrieve any amount of data, any time, from anywhere on the web
Highly scalable, reliable, fast and durable
Amazon Glacier• Meet your compliance requirements• Long term archival and near-line DR• Eleven nines of durability as S3 standard• All data encrypted using Server Side Encryption• Starting at $0.01/GB/month
“Every day our genome sequencers produce terabytes of data. As our company moves into the clinical space, we face a legal requirement to archive patient data for years that would drastically raise the cost of storage. Thanks to Amazon Glacier’s secure and scalable solution, we will be able to provide cost-effective, long-term storage and thereby eliminate a barrier to providing whole genome sequencing for medical treatment of cancer and other genetic diseases.”
- Keith Raffel, Senior Vice President and Chief Commercial Officer, Complete Genomics
Long-Term Data ArchivalDem
o!
Distributed Network Filesystem
HDFS, PVFS, Lustre, Gluster FS, Orange FS, NFS, …
Sample AWS ArchitectureProcessing large amounts of parallel data using a scalable cluster
Use commonly-available tools, includingGrid Engine, Condor, Star Cluster, Mesos, YARN, …