Top Banner
Hadoop and Microsoft. d Sarsfield | Senior Software Engineer @bradoop
12

Hadoop acm presentation

May 28, 2015

Download

Documents

Brad Sarsfield

Microsoft Hadoop presentation for ACM Data Mining Hackathon competition.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Hadoop acm presentation

Hadoop and Microsoft.

Brad Sarsfield | Senior Software Engineer @bradoop

Page 2: Hadoop acm presentation

BIG DATA

HADOOP

MICROSOFT & HADOOP

Agenda

Page 3: Hadoop acm presentation

How Big is Big Data?

Page 4: Hadoop acm presentation

It’s all about your BigDataProblemsBig Problems

Page 5: Hadoop acm presentation

Hadoop is for Big Data.

Page 6: Hadoop acm presentation

Data is the Platform.

Page 7: Hadoop acm presentation

Hadoop Data Science.

Page 8: Hadoop acm presentation

Hadoop Capabilities.

Machine Learning

Graph Processing

Distributed Compute

Extract Load Transform

Predictive

Analysis

Page 9: Hadoop acm presentation

Distributed Storage(HDFS)

Query(Hive)

Hadoop architecture.

Distributed Processing(Map Reduce)

Scripting

(Pig)

NoSQ

L Data

base

(HB

ase

)

Metadata(HCatalog)

Data

Inte

gra

tion

( OD

BC

/ SQ

OO

P/

REST)

Busin

ess In

tellig

ence

(E

xcel, Po

werV

iew

…)

Machine Learning(Mahout)

Graph(Pegasus)

Stats processin

g(RHadoop

)

Pipelin

e /

workflo

w(O

ozie

)

Log file

aggre

gatio

n(Flu

me)

Page 10: Hadoop acm presentation

Hadoop and Microsoft.

We are delivering• Apache Hadoop on Windows Server• Apache Hadoop on Windows Azure

Big engineering investment• Big Data Business Intelligence tooling• Big Data Apache Hadoop• Big Data Parallel Data Warehouse

Open source Commitment• Apache Software Foundation• Hortonworks Partnership

Page 11: Hadoop acm presentation

Microsoft Hadoop Vision.

Microsoft Business Intelligence (BI) • ODBC Connectivity

Better on Windows and Azure • Active Directory• System Center

Microsoft Data Connectivity• SQL Server / SQL Parallel Data Warehouse• Azure Storage / Azure Data Market

Page 12: Hadoop acm presentation

ACM Hackathon.

Hadoop on Azure demo

Free Hadoop on Azure• Code: acmhackathon

Free 30 day Azure account • No credit card• 750h small compute / 35GB storage• Email [email protected] for code