Top Banner
Amazon Web Services for the SQL Server Professional Lynn Langit Architect Level: Intermediate
32

AWS for the Data Professional

May 14, 2015

Download

Software

Lynn Langit

Core AWS services for the data professional - EC2, RDS, S3, Kinesis and more
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: AWS for the Data Professional

Amazon Web Services for the SQL Server Professional

Lynn LangitArchitect

Level: Intermediate

Page 2: AWS for the Data Professional

What and Why AWS?

AWS Amazon’s

cloud

Set of services

Compute

Data

More

Market leader

In market longest

Usually cheapest

Most often used in

production

Page 3: AWS for the Data Professional

Amazon Web Services

Page 4: AWS for the Data Professional

EC 2- VMs for train, test & productionPricing• On-demand• Spot• Reserved

Page 5: AWS for the Data Professional

Demo - EC2• Virtual Machines

5

Page 6: AWS for the Data Professional

S3 and Glacier

Page 7: AWS for the Data Professional

About EC2 storage

S3 • 10 GB max• 3 copies• Usually for data storage

EBS – expand / snapshot, etc…• Can store AMIs (persistent)• Can ‘stop’ EC2 instances and

‘re-start’ – saves $$$• Costs more• Can expand• One copy only (faster)

SSD – optional• For high performance• Provisioned IOPs

Page 8: AWS for the Data Professional

Demo – S3 • File Storage

8

Page 9: AWS for the Data Professional

Demo – Glacier• Archival Storage

9

Page 10: AWS for the Data Professional

RDS – Managed Relational Data

Page 11: AWS for the Data Professional

Demo – RDS• SQL Server as a service

11

Page 12: AWS for the Data Professional

RDS vs. EC2 for SQL Server

• Provisioned IO – performance guarantees

• Scheduled backups • Point in time restores• Scheduled maintenance

windows• Full use of all SQL tools, SSMS,

Profiler, DTA, etc…• Supports Availability Groups

(requires 2012 Enterprise)

Why RDS costs more

Page 13: AWS for the Data Professional

Redshift – $999 / TB / year

Page 14: AWS for the Data Professional

Demo – Redshift• Data Warehousing as a Service

14

Page 15: AWS for the Data Professional

DynamoDB for fast NoSQL with SSDs

Page 16: AWS for the Data Professional

Demo – DynamoDB• NoSQL on SSD

16

Page 17: AWS for the Data Professional

Elastic MapReduce for easy Hadoop

Page 18: AWS for the Data Professional

Demo – MapReduce• Hadoop on AWS

18

Page 19: AWS for the Data Professional

Kinesis for real-time Big Data Streams

Page 20: AWS for the Data Professional

Demo – Kinesis• Real-time streaming for Big Data

20

Page 21: AWS for the Data Professional

Data Pipelines – automated data transfer

Page 22: AWS for the Data Professional

Demo – Data Pipeline• Build data flows on AWS

22

Page 23: AWS for the Data Professional

Integration w/ Visual Studio – AWS SDK

See Also:• AWS Tools for Windows

Developers• Includes AWS Powershell

Page 24: AWS for the Data Professional

AWS SDK includes AWS Powershell

Page 25: AWS for the Data Professional

Demo – AWS SDK• Add-in for Visual Studio and .NET

25

Page 26: AWS for the Data Professional

Cloud Database Services by VendorAWS Google Microsoft

RDBMS VMs EC2 AMIs w/SQL Server, etc… GCE w/MySQL Azure VM images w/SQL Server

Managed RDBMS RDS - SQL Server, MySQL Cloud SQL - MySQL SQL Azure

NoSQL buckets/databases

S3, EBS, Glacier, DynamoDB Cloud Storage HR Datastore on GAE

Azure Blobs & Tables

Pipelines Data Pipelines Data Pipelines (beta) SSIS?

Streaming Machine Learning

Kinesis orCustom EC2

BigQuery &Prediction API

StreamInsight Azure Machine Learning

Document MongoDB on EC2 MongoDB on GCE MongoDB on Windows Azure

Hadoop MapReduce Big Query (Dremel) HDInsight

Other Redshift – Data WarehouseWorkspaces & Zocalo

Managed VMsGAE

Azure Marketplace – premium data

Page 27: AWS for the Data Professional

Costs - Free Tier for Database Services

Page 28: AWS for the Data Professional

How much does it cost?

Tip: When testing use Billing Alerts to make sure you’ve turned off test services!

Page 29: AWS for the Data Professional

Creative Financing

• Use what you need and no more, i.e. instance size, storage size…• Watch for price drops – RDS price decrease this week

Regular Pricing

• Pause EC2 instances to reduce compute charges• Delete EC2 instances to reduce storage charges

Smart EC2 Instance Usage

• Set pricing alerts• Use spot pricing• Re-selling compute / storage

Vanity Pricing

Page 30: AWS for the Data Professional

Usage Summary

Compute

EC2

Dev & Test

Train

Prod

Storage

S3

Raw Storage

Glacier

Archiving

Data Services

RDS

Partially Managed RDBMS

HA SQL Server

Redshift

Data Warehousing

DynamoDB

fast NoSQL – on SSDs

EMR

On Demand MapReduce

Kinesis

Streaming

Data Pipelines

Automation

Page 31: AWS for the Data Professional

31

Page 32: AWS for the Data Professional

Keep Learning

• Connect– @LynnLangit– www.youtube.com/user/SoCalDevGal

• Get started– Sign up for AWS – use ‘Free Tier’ – Email me to get $100 AWS usage credit