Top Banner
Grab some coee and enjoy the pre-show banter before the top of the hour!
38

The Big Picture: Understanding the Many Roles of Hadoop

Jul 16, 2015

Download

Technology

Inside Analysis
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: The Big Picture: Understanding the Many Roles of Hadoop

Grab some

coffee and

enjoy the

pre-show

banter before

the top of the

hour!

Page 2: The Big Picture: Understanding the Many Roles of Hadoop

The Big Picture: Understanding the Many Roles of Hadoop Exploratory Webcast | January 28, 2015

SPONSORED BY

Page 3: The Big Picture: Understanding the Many Roles of Hadoop

Guests

Robin Bloor Chief Analyst, The Bloor Group @robinbloor [email protected]

Eric Kavanagh CEO, The Bloor Group @eric_kavanagh [email protected]

Page 4: The Big Picture: Understanding the Many Roles of Hadoop

Findings Webcast May 27, 2015

Making Sense of Hadoop

Roundtable Webcast March 18, 2015

Exploratory Webcast January 28, 2015

#Hadoop

Page 5: The Big Picture: Understanding the Many Roles of Hadoop
Page 6: The Big Picture: Understanding the Many Roles of Hadoop

Making Sense of Hadoop

Robin Bloor, PhD

Page 7: The Big Picture: Understanding the Many Roles of Hadoop

In Three Segments

The Forces of Disruption

Focus Areas

Hadoop: Then, Now & Later

PART ONE

PART THREE

PART TWO

Page 8: The Big Picture: Understanding the Many Roles of Hadoop

The Forces of Disruption

The Forces Of Disruption

Page 9: The Big Picture: Understanding the Many Roles of Hadoop

The Generic Dimensions of IT

u  All IT involves 4 components (only) •  Users •  Software •  Data •  Hardware

u  Change any one of these and the other three components have to adjust

u  Aggregate these and you get a process

u  Time will impose change anyway

u  We can also consider a larger field, since this applies to all systems not just IT systems

Four Fundamental (IT) Factors

Hardware

Users

Software Data

Business

InformationB

usinessProcess

Hum

anActivity

AllInform

ation

Staff

Facility

People

Civilization

TIME

Page 10: The Big Picture: Understanding the Many Roles of Hadoop

The Hexagon of Business Change

u  Speed •  Speed of action •  Speed of business process

u  Cost •  Cost of acquisition •  Cost of ownership

u  Time •  Time to deploy •  Time to employ

u  Business Value •  By competitiveness •  By cost reduction

u  Effort •  Effort to develop •  Effort to deploy

u  Fit •  Compatible •  Incompatible

Plus, capacity to change

SPEED

TIMETAKEN

EFFORT

FIT

VALUE

Speed of ProcessSpeed of Action

CompatibleIncompatible

TimetoDeploy

TimetoEmploy

Cost Reduction

Competitiveness

AcquisitionCost

TCOCOST

Effort toDeploy

Effort toDevelop

Hexagon ofChange Factors

Plus Capacity

Page 11: The Big Picture: Understanding the Many Roles of Hadoop

The Technology Layers

u  The buying impulse descends through the stack

u  The impact of technology change rises up the stack

u  This ensures the eventual “legacification” of all technology

The BuyingImpulse Goes

Down

TechnologyChange Rises Up

The TechnologyLayers

Page 12: The Big Picture: Understanding the Many Roles of Hadoop

Technology Layer Perspectives

u  This simple model has a number of uses

u  For example, we can use it to depict the “aaS options”

u  More importantly we can use it to track disruption …

u  More of which later…

The aaS Possibilities

Page 13: The Big Picture: Understanding the Many Roles of Hadoop

Disruption in The Technology Layers

u  Disruption (as innovation) can happen in any layer

u  Where it occurs, it will impact all layers above it

u  And it may also impact the layers below it (but less quickly)

u  There is no such thing as future-proof; but some technologies definitely live longer

The BuyingImpulse Goes

Down

TechnologyChange Rises Up

The TechnologyLayers

Page 14: The Big Picture: Understanding the Many Roles of Hadoop

Mainframe Computer (Batch architecture)

On-line Interaction (Centralized architecture)

PC (Client server)

Internet (Multi-tier architecture)

Mobile (Service oriented architecture)

Internet of things (Event driven architecture)

Tech Revolutions

Note that all of these disruptive changes were driven by hardware innovation

Page 15: The Big Picture: Understanding the Many Roles of Hadoop

Hardware Layer Disruption

u  SSD is now on the Moore’s Law curve

u  Spinning disk has almost popped its clogs

u  Memory grows and can be networked

u  CPUs still evolving: CPU and GPU merged

Page 16: The Big Picture: Understanding the Many Roles of Hadoop

Hierarchical Memory

u  On chip speed v RAM •  L1(32K) = 100x •  L2(246K) = 30x •  L3(8-20Mb) = 8.6x

u  RAM v SSD •  RAM = 300x

u  SSD v Disk •  SSD = 10x

Note: Vector instructions and data compression

Page 17: The Big Picture: Understanding the Many Roles of Hadoop

In-Memory Disruption

u  Memory will become the primary store for data (this impacts data flows)

u  Almost all applications are poorly built for this

u  Memory is an accelerator, as is CPU cache – this is becoming a factor

u  HP’s Memristor waits in the wings

Page 18: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop: Deceptive Impression Because Hadoop was built to run on 1000s of servers, there’s an impression that Hadoop needs such huge

clusters/grids

In reality the opposite is now happening, the number of servers is diminishing, which means that Moore’s Law still

operates

Page 19: The Big Picture: Understanding the Many Roles of Hadoop

Putting a SoC in IT

u  It’s possible that the CPU-Memory split will vanish, possibly soon

u  This requires the emergence of the commodity SoC

u  There are already SoCs that run Linux

u  Grids of SoCs would replace grids of servers

Page 20: The Big Picture: Understanding the Many Roles of Hadoop

Parallelism: The Imp Is Out of the Bottle

u Multicore chips enabled parallelism

u  It has changed the whole performance equation

u  It enabled Big Data

u  Big Data is really Big Processing

Page 21: The Big Picture: Understanding the Many Roles of Hadoop

Some Architectural Principles

u  The new atom of data is the event

u  SUSO, scale up before scale out

u  Take the processing to the data, if you can

u  Hadoop is a component not a solution

Page 22: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop: Then, Now & Later

Page 23: The Big Picture: Understanding the Many Roles of Hadoop

The Hadoop Ecosystem

u  Apache Projects: HBase, HCatalog, Pig, Hive, Flume, Storm, Sqoop, Nutch, Avro, Oozie, ZooKeeper, etc.

u  New commercial products: Actian, RedPoint, Attunity, Voltage Security, etc.

u  Languages and dev environments

Page 24: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop Usage

u  Data archive

u  Data staging & ETL

u  Data preparation

u  Analytics sandbox

u  Analytics platform

u  Database environment

Page 25: The Big Picture: Understanding the Many Roles of Hadoop

The State of Play

Graphic from Allied Market Research

There are some straws in the wind here: it is being used everywhere

It’s a HUGE market Also VC investment is massive

Page 26: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop as a Clip-On

Page 27: The Big Picture: Understanding the Many Roles of Hadoop

Data Lake, Refinery, Hub: In Overview

Think Logical, Implement Physical

Page 28: The Big Picture: Understanding the Many Roles of Hadoop

Two Data Flows

Page 29: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop in the Technology Layers

Hadoop in theTechnology Layersu  Hadoop starts as a scale-

out file system with a one-dimensional development environment

u  It evolves with the addition of YARN to begin to occupy the OS & Sys Mgt layer

u  Analytics applications become synonymous with Hadoop

u  Hadoop is migrating through the stack

Page 30: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop as an OS

u  The Trail of OSes •  OS/360 -> OS/370 -> z/OS •  VMS •  Unix -> Solaris •  MS-DOS -> Windows •  Linux •  OS X -> iOS

u  OSes evolve in two ways •  Own development •  Third party add-ons

u  They create application ecosystems u  In time they make previous OSes

obsolete u  This is what Hadoop is in the

process of doing

Page 31: The Big Picture: Understanding the Many Roles of Hadoop

Focus Areas

Page 32: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop in the Basic Map

u  Hardware & Cloud

u  Software

u  Data realities

u  Usage

Four Fundamental (IT) Factors

Hardware

Users

Software Data

Business

InformationB

usinessProcess

Hum

anActivity

AllInform

ation

Staff

Facility

People

Civilization

TIME

Page 33: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop in the Hexagon

Plus, capacity to change

SPEED

TIMETAKEN

EFFORT

FIT

VALUE

Speed of ProcessSpeed of Action

CompatibleIncompatible

TimetoDeploy

TimetoEmploy

Cost Reduction

Competitiveness

AcquisitionCost

TCOCOST

Effort toDeploy

Effort toDevelop

Hexagon ofChange Factors

Plus Capacity

u  Speed •  Speed of action •  Speed of business process

u  Cost •  Cost of acquisition •  Cost of ownership

u  Time •  Time to deploy •  Time to employ

u  Business Value •  By competitiveness •  By cost reduction

u  Effort •  Effort to develop •  Effort to deploy

u  Fit •  Compatible •  Incompatible

Page 34: The Big Picture: Understanding the Many Roles of Hadoop

Hadoop in the Technology Layers

Hadoop in theTechnology Layers

u  Speed •  Speed of action •  Speed of business process

u  Cost •  Cost of acquisition •  Cost of ownership

u  Time •  Time to deploy •  Time to employ

u  Business Value •  By competitiveness •  By cost reduction

u  Effort •  Effort to develop •  Effort to deploy

u  Fit •  Compatible •  Incompatible

Page 35: The Big Picture: Understanding the Many Roles of Hadoop

In Three Segments

Part 1 - The Forces of Disruption

Part 2 – Hadoop: Then, Now & Later

Part 3 – Focus Areas

Page 36: The Big Picture: Understanding the Many Roles of Hadoop
Page 37: The Big Picture: Understanding the Many Roles of Hadoop

Questions?

#Hadoop or

USE THE Q&A

Page 38: The Big Picture: Understanding the Many Roles of Hadoop

THANK YOU!

FIND OUT MORE at http://insideanalysis.com/research/making-sense-of-hadoop