Top Banner
Microsoft さとうなおき (@satonaoki)
51

[Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Jan 23, 2018

Download

Data & Analytics

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Microsoft

さとうなおき (@satonaoki)

Page 2: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Why Deploy To the Cloud?

Microsoft’s Solution

How Do I Get Started?

Page 3: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Breaking points of traditional approach

Page 4: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Breaking points of traditional approach

Page 5: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Breaking points of traditional approach

Page 6: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Breaking points of traditional approach

Page 7: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

What if you could handle big data?

Page 8: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Introducing Apache Hadoop

Page 9: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Data volume

Page 10: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Data variety

Page 11: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Data velocity

Page 12: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Hadoop is a platform with portfolio of projects

Page 13: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

A Hadoop distribution is a package of projects

Page 14: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

With many contributors

Page 15: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Business applications of Hadoop

Page 16: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

New analytic applications from new data

Page 17: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

What Is Hadoop?

Microsoft’s Solution

How Do I Get Started?

Page 18: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Challenges with implementing Hadoop

Page 19: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Why Cloud + Big Data?

Speed Scale Economics

Always Up,

Always OnOpen and flexibleTime to value

Data of all Volume,

Variety, Velocity

Massive Compute

and Storage

Deployment

expertise

Page 20: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Why Hadoop in the Cloud?

Page 21: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Scenarios For Deploying Hadoop As Hybrid

Page 22: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

What Is Hadoop?

Why Deploy To the Cloud?

Microsoft’s Solution

Page 23: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Introducing Azure HDInsight

Page 24: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Microsoft contributions to Hadoop

Page 25: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Microsoft + Hortonworks

Page 26: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

HDInsight Built for Windows or Linux

Page 27: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

HDInsight Supports Hive

Hadoop 2.0

Page 28: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

HDInsight Supports HBase

Data Node Data Node Data Node Data Node

Task Tracker Task Tracker Task Tracker Task Tracker

Name Node

Job Tracker

HMasterCoordination

Region Server Region Server Region Server Region Server

Page 29: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

HDInsight Supports Mahout

Page 30: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

HDInsight Supports Storm

Stream processin

g

Search and query

Data analytics (Excel)

Web/thick client

dashboards

Devices to take action

RabbitMQ /

ActiveMQ

Page 31: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Spark for Azure HDInsight In Memory Processing on Multiple Workloads

Azure

HDInsight

In Memory

Spark

• Single execution model for multiple

tasks

• Processing up to 100x faster

performance

• Developer friendly (Java, Python, Scala)

• BI tool of choice (Power BI, Tabelau,

Qlik, SAP)

• Notebook experience (Jupyter/iPython,

Zeppelin)

Page 32: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

R Server for HDInsight

• Familiarity of R (most popular language for data scientists)

• Scalability of Hadoop and Spark

• Up to 7x faster using Spark engine

• Train and run ML models on datasets of any size

• Cloud managed solution (easy setup, elastic, SLA)

Page 33: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

HDInsight Allows You To Add Hadoop Projects

Page 34: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Microsoft Makes Hadoop EasierDeep Visual Studio Integration• Debug Hive jobs through Yarn logs or troubleshoot Storm topologies

• Visualize Hadoop clusters, tables, and storage

• Submit Hive queries, Storm topologies (C# or Java spouts/bolts)

• IntelliSense

Page 35: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Introducing Azure HDInsight

Page 36: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Big Opportunities in the Cloud

Source:

1: http://www.idc.com/getdoc.jsp?containerId=prUS25329114

2: Gartner Market Guide For Hadoop. December, 2015

3X

Spending on cloud-

based Big Data and

analytics solutions will

grow 3 times faster than

on-premises solutions1

52%

52% of surveyed

organizations plan to

use or continue to

deploy Hadoop in the

cloud (IaaS and PaaS)2

128% 18

Microsoft is the only

company with cloud

revenue at large scale

that grew triple digits in

its fifth consecutive

quarter

Microsoft Azure

running Hadoop in

more datacenters

around the world than

anyone else

Page 37: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Operational

Central US

Iowa

West US

California

East US

Virginia

North Central US

Illinois

US Gov

Iowa

South Central US

Texas

Brazil South

Sao Paulo State

West Europe

Netherlands Sovereign

Cloud: China

North *

Beijing

Sovereign

Cloud: China

South *

Shanghai

Japan East

Tokyo, Saitama

Japan West

Osaka

India South

Chennai

East Asia

Hong Kong

SE Asia

Singapore

Australia South East

Victoria

Australia East

New South Wales

India Central

Pune

India West

Mumbai

North Europe

Ireland

East US 2

Virginia

Hadoop is being run everywhereMore Datacenters than any other vendor

Page 38: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Why Microsoft Azure?

Azure Storage

Page 39: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

No hardware challenges

Page 40: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Deployed in minutes

Page 41: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Mission Critical, Enterprise Ready

Page 42: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Maintenance done for you

Page 43: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Low Cost

$£€¥

*IDC study “The Business Value and TCO Advantage of Apache Hadoop in the Cloud with Microsoft Azure HDInsight”

Page 44: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Introducing Azure HDInsight

Page 45: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Bringing Hadoop to a billion people

Page 46: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Making advanced analytics accessible to Hadoop

Cloud

Page 47: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight
Page 48: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight
Page 49: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

What Is Hadoop?

Why Deploy To the Cloud?

Microsoft’s Solution

Page 50: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

Get Started

http://azure.microsoft.com/en-us/documentation/services/hdinsight/

http://azure.microsoft.com/en-us/documentation/articles/hdinsight-learn-map/

http://www.microsoftvirtualacademy.com/training-courses/getting-started-with-microsoft-big-data

http://channel9.msdn.com/Shows/Data-Exposed

http://azure.microsoft.com/en-us/pricing/free-trial/

Page 51: [Azureビッグデータ関連サービスとHortonworks勉強会] Azure HDInsight

© 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.

The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market

conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation.

MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.