Top Banner
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Marc Trimuschat AWS Storage Services November 2016 AWS Data Transfer Services Data Ingest Strategies into the AWS Cloud ENT210
42

AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Apr 16, 2017

Download

Technology

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Marc Trimuschat

AWS Storage Services

November 2016

AWS Data Transfer ServicesData Ingest Strategies into the AWS Cloud

ENT210

Page 2: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Storage is the gravity

for cloud applications

Page 3: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Batches and Streams

Direct

Connect

Snowball,

Snowball Edge,

Snowmobile

3rd Party

Connectors

Transfer

Acceleration

Storage

GatewayKinesis Firehose

File

Amazon EFS

Block

Amazon EBS (persistent)

Object

Amazon GlacierAmazon S3 Amazon EC2

Instance Store (ephemeral)

Internet/VPN CloudFront

Page 4: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Internet/VPN ingest

Page 5: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

What is Internet/VPN?

Globally available

Default method of ingesting content into Amazon S3

Simple standards-based (HTTP) connection

Use your existing internet connection

Available in a VPC for VPN connectivity

Acceleration through multipart upload

Data transfer into AWS is free

VPN connections using VPC virtual private gateway•$0.05 per VPN connection-hour

•$0.048 per VPN connection-hour for connections to the Tokyo region

Page 6: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

How does Internet/VPN ingest work?

Accelerate data transfer using

multipart upload

Ingest data directly into S3 buckets

with existing internet connectivity

S3 bucket

AWS Region

and

through the console or API

customer

gateway

endpoints

VPN

connection

Internet Internet through VPN +

VPC

Page 7: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Amazon S3 Transfer Acceleration

Page 8: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

What is Transfer Acceleration?

Network- and protocol-based data transfer service

Acceleration of data ingress/egress with S3 buckets

Typically 50% to 300% faster

Feature of S3 enabled at the bucket level

Available in all S3 regions worldwide

No client/server software required

No code changes to your application

No firewall exceptions

Simple pricing model

Page 9: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Ingest & egress with Transfer Acceleration

S3 bucketAWS edge

location

Uploader

Optimized

throughput!

Uses AWS 59 global edge locations

AWS determines best edge location

Data transfer optimized between

edge and customer, and edge and S3

Data is not stored on the edge cache

Page 10: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Customers: Frame.io, Hudl, Viocorp

Problem Statement:• Needed to accelerate customer content ingest into their respective

applications running on AWS

• Existing ingest options were proprietary and too expensive

Use of AWS:• S3 and S3 transfer acceleration for massively scalable ingest

• S3 for storage, CloudFront and S3 transfer acceleration for ingest

Business Benefits: • Global highly distributed data transport available on demand

• Massive scalability and elasticity

• Lower TCO for storage and data transport infrastructure

Accelerating media content uploads to their platforms

S3 BucketAWS Edge

Location

Uploader

Optimized

Throughput!

Page 11: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Amazon

Route 53

Resolve

b1.s3-accelerate.amazonaws.com

HTTPS PUT/POST

upload_files.zip

HTTP/S PUT/POST

“upload_files.zip”

Service traffic flowClient to S3 bucket example

S3 bucket

b1.s3-accelerate.amazonaws.com

EC2 proxy

AWS region

AWS edge location

Customer client

1

2

3

4Data is not cached on the

AWS edge location

Fully managed file transfer acceleration

using all AWS edge locations

Page 12: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Rio DeJaneiro

Warsaw New York Atlanta Madrid Virginia Melbourne Paris LosAngeles

Seattle Tokyo Singapore

Tim

e [h

rs]

500 GB upload from these edge locations to a bucket in Singapore

Public internet

How fast is S3 Transfer Acceleration?

S3 transfer acceleration

Page 13: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

AWS Direct Connect

Page 14: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

What is AWS Direct Connect?

Dedicated, 1 or 10 GE private pipes into AWS

Create private (VPC) or public virtual interfaces to AWS

Reduced data-out rates (data-in still free)

Consistent network performance

At least 1 location to each AWS region

Option for redundant connections

Uses BGP to exchange routing information over a VLAN

Page 15: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Physical connection

• Cross-connect at the location

• Single-mode optical fiber

- 1000Base-LX or 10GBASE-LR

• Potential onward delivery through Direct Connect partner

• Customer router

Page 16: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

At the Direct Connect location

CORP

AWS Direct

Connect

Routers

Customer

Router

Colocation

DX Location

Customer

network`

AWS backbone

network

Cross-

connect

Customer

router

Customer’s network

Demarcation

Page 17: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Dedicated port through Direct Connect partner

CORP

AWS Direct

Connect

Routers

Colocation

DX Location

Partner network

AWS backbone

network

Cross-

connect

Customer

router

Partner

network

Access

circuit

Demarcation

Partner

equipment

Page 18: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Hybrid cloud storage expansion:

Amazon EFS through Direct Connect

“Bursting”

File WorkloadsData Migration

into EFS

Amazon EFSOn-Premises AWS Direct Connect

Page 19: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

AWS Storage Gateway

Page 20: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

What is AWS Storage Gateway?

Works with your existing applications

Secure and durable storage in AWS

Low latency for frequently used data

Scalable and cost-effective on-premises storage - $.01/GB written to AWS + S3/Amazon Glacier storage fees

Service connecting an on-premises software appliance

with cloud-based storage

Page 21: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Hybrid storage use cases and architectures for

AWS Storage Gateway

Enabling cloud workloadsMove data to AWS storage for Big Data, cloud bursting, or migration

Tiered cloud storageEasily add AWS storage to your on-premises environment

Backup, archive, and disaster recoveryCost effective storage in AWS with local or cloud restore

Page 22: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Storage Gateway hybrid storage solutionsEnables using standard storage protocols to access AWS storage services

Customer Premises

Storage

Gateway

Amazon EBS

snapshots

Amazon

S3

Amazon Glacier

AWS Identity and Access

Management (IAM)

AWS Key Management

Service (KMS)

AWS

CloudTrail

Amazon

CloudWatch

Enterprise

storage

Devices

Application

servers

Page 23: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Storage gateway – Files, volumes, and tapes

File gateway NFS (v3 and v4.1) interface **NEW!**

On-premises file storage backed by Amazon S3 objects

Volume gateway iSCSI block interface

On-premises block storage backed by Amazon S3 with EBS snapshots

Tape gateway iSCSI virtual tape library (VTL) interface

Virtual tape storage in Amazon S3 and Glacier with VTL management

Page 24: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Detail: AWS File Gateway for S3

NFS Interface Elasticity Amazon S3 Bucket

Easy Integration Cloud ScaleCloud Access

Page 25: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

AWS File Gateway

Bursting Tiered Storage

STG213

Page 26: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

AWS Snowball

Page 27: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

What is AWS Snowball?

Petabyte-scale data transport

E-ink shipping

label

Ruggedized case

“8.5G impact”

All data encrypted

end-to-end

Rain- and dust-

resistant

Tamper-resistant

case and

electronics

80 TB

10 GE network

Page 28: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

AWS storage migration expansion:

AWS Snowball

Transfer

CapacityIntegration

Regional

Availability

80TB model

HDFS support

3rd party API

HIPAA support

All EXCEPT:

Asia Pacific (Singapore)

Asia Pacific (Seoul)

China (Beijing)

Page 29: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

How it works

Page 30: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

How fast is Snowball?

• Less than 1 day to transfer 200TB via 3x10G connections with 3

Snowballs, less than 1 week including shipping

• Number of days to transfer 200TB via the Internet at typical utilizations

Internet Connection Speed

Utilization 1Gbps 500Mbps 300Mbps 150Mbps

25% 71 141 236 471

50% 36 71 118 236

75% 24 47 225 157

Page 31: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Customer: Scripps Networks Interactive

Problem Statement:• Need storage platform to manage active archive content

• Existing content repository too large to migrate via available

network-based ingest methods

Use of AWS:• S3 and Snowball for massively scalable ingest

• S3 for storage, Glacier for content archive

• Snowball to securely transport existing media content from on-

premises storage and tape vault

Business Benefits: • Petabyte-scale data transport without increased network costs

• Massive scalability and elasticity

• Lower TCO for active archive storage

Active archive transport and archival for digital content provider

Page 32: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

AWS storage migration expansion:

AWS Snowmobile

Page 33: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Hybrid cloud storage expansion:

AWS Snowball Edge

On-premises

CapacityOn-premises

Integration

On-premises

Compute

Clustered local storage

100TB capacity

NFS and S3-compatible

endpoint

AWS Lambda

support

Page 34: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Snowball Edge use cases

Offline

Staging

Local Tiering and

ComputeIoT

Local

Transformation

Page 35: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

AWS Snowball Edge

Integrated

Storage and Compute

Applications

STG214

Page 36: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Storage Ecosystem Partners

Page 37: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Hybrid cloud storage ecosystem

BackupAWS Storage Gateway VTL

Direct to Amazon S3

File

SystemsObject Storage

Block Storage

Page 38: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Backup to AWS approaches

Amazon S3

Amazon

GlacierAWS

Direct

Connect

Internet

Amazon S3-IA

Application

servers

Cloud gateway

Local disk

Media

server

Cloud gateway

Application

servers

Backup SW cloud connector

Local disk

Media

server with cloud

connector

Page 39: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Bursting

Migration

Tiering

Accessing EFS

via Direct ConnectStorage

Gateway

Snowball

Edge

3rd Party

Ecosystem

AWS hybrid cloud storage choices

Page 40: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Related Sessions

• Snowball Edge: STG214, ENT211

• Snowmobile: STG214, ENT211

• Storage Gateway: STG213, ENT211

Page 41: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Thank you!

Page 42: AWS re:Invent 2016: Deep Dive on AWS Cloud Data Migration Services (ENT210)

Remember to complete

your evaluations!