Top Banner
1 © Copyright 2013 EMC Corporation. All rights reserved. Create Your Big Data Vision And Hadoop-ify Your Data Warehouse Jeff Kelly, Big Data Analyst The Wikibon Project Bill Schmarzo, CTO EIMA Practice, EMC Professional Services
21

Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

Nov 01, 2014

Download

Technology

Join The Wikibon Project’s Big Data Analyst, Jeff Kelly and EMC’s Enterprise Information Management CTO for an engaging discussion on big data use cases and new technologies. These thought leaders will provide practical advice on how you can ensure your big data analytics initiative is focused on the business opportunities that provide the optimal trade-off between business benefit and implementation feasibility. Once you’ve selected the best place for big data, create an architecture that will ensure the successful deployment of technologies such as Hadoop and MPP databases.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

1 © Copyright 2013 EMC Corporation. All rights reserved.

Create Your Big Data Vision And Hadoop-ify Your Data Warehouse

Jeff Kelly, Big Data Analyst

The Wikibon Project

Bill Schmarzo, CTO EIMA Practice, EMC Professional

Services

Page 2: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

2 © Copyright 2013 EMC Corporation. All rights reserved.

Agenda

Current Market Observations

The Big Data Business Maturity Index and How to Identify Your Best Use Case

Get Started With Hadoop and Other New Technologies

What Should You Look For in a Vendor?

Q&A

Page 3: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

3 © Copyright 2013 EMC Corporation. All rights reserved.

Current Market Observations

Jeff Kelly

Page 4: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

4 © Copyright 2013 EMC Corporation. All rights reserved.

Big Data Market Size

2012

$11.4b

2013

$18.2b

2017

$48b

59% Growth Y-o-Y 2011 to 2012

Forecast 60%+ Growth in 2013

31% CAGR Forecast 2012 through 2017

2014 $28b

2015 $37.9b

2016 $43.7b

Source: Wikibon Big Data Vendor Revenue and Market Forecast, 2012-2017

Page 5: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

5 © Copyright 2013 EMC Corporation. All rights reserved.

Big Data Market Segmentation, 2012 Services Leading the Way

Professional Services

$3,784m 34%

Cloud and SaaS $608m

5%

0, 0%

0, 0% 0, 0% 0, 0% 0, 0% Pro. Services

Compute

Storage

Networking

Database

Applications

Data mgt.

Cloud n = $11,400m

Page 6: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

6 © Copyright 2013 EMC Corporation. All rights reserved.

Big Data Growth Drivers

Continued Investment by Web Pioneers and Three Letter Agencies Google alone spent $1b+ on infrastructure in Q4 2012 “Everything we do is a Big Data problem.” – Jay Parikh, VP of Engineering, Facebook CIA CTO Ira Hunt: Our mission is to “collect everything and hang on to it forever”

Increased Awareness and Investments By Large Enterprises Beyond the Web Retailers like Sears leveraging Big Data for price

optimization Financial services firms, including JPMC, Morgan

Stanley and BoA, conduct fraud analysis, risk profiling and more

Pharmaceutical including Bristol Myers Squibb makers use Big Data to support drug development

Page 7: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

7 © Copyright 2013 EMC Corporation. All rights reserved.

Big Data Growth Drivers, Cont.

Increasingly Sophisticated Professional Services Professional services building on experience of assisting early

adopters Some (but not all) are vendor and product agnostic Focusing on identifying use cases, improving communication, and

leveraging existing assets

Technology Maturation Open source community and vendors

making Hadoop enterprise-ready, easier to use

Better integration between Big Data and existing IT infrastructure

Extending Big Data accessibility to business users via BI and data visualization tools.

Consulting

Training & Educations

Integration

Page 8: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

8 © Copyright 2013 EMC Corporation. All rights reserved.

Big Data Growth Inhibitors Lack of Data Scientists and Big Data

Practitioners

Big Data Technology Still Complex, Difficult to Manage/Use

Organizational Resistance to Data-Driven Decision Making

Confusion Due to Vendor Marketing and “Big Data Washing”

Big Data [Your Product Name Here]

Page 9: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

9 © Copyright 2013 EMC Corporation. All rights reserved.

Identify Your Best Big Data Use Case

Bill Schmarzo

CTO, EIM&A Practice EMC Consulting

Page 10: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

10 © Copyright 2013 EMC Corporation. All rights reserved.

Business Metamorphosis

Data Monetization

Business Optimization

Business Insights

Business Monitoring

Big Data Business Model Maturation Index Measures the degree to which the

organization has integrated big data

and advanced analytics into their

business

Page 11: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

11 © Copyright 2013 EMC Corporation. All rights reserved.

Get Started With Hadoop And Other New Technologies

#1) Increase Data Platform Performance With MPP Platform

#2) Embrace Hadoop To Create Next Gen ODS/Data Staging

#3) Leverage Hadoop To Create New Unstructured Data Metrics

#4) Extend Data Warehouse Via Data Virtualization

#5) Deploy In-database Analytics To Accelerate Analytics

Page 12: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

12 © Copyright 2013 EMC Corporation. All rights reserved.

• Massively Parallel Processing (MPP), scale-out architectures provide cost effective options for managing and analyzing massive volumes of structured and unstructured data

• MPP data warehouses provide linear scalability on general purpose, commodity systems

#1) Increase Data Platform Performance With MPP

Page 13: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

13 © Copyright 2013 EMC Corporation. All rights reserved.

Hadoop Data Store Analytics Environment

Data Preparation and Enrichment

ALL data fed into Hadoop Data Store

EDW ETL

Analytic Sandbox

BI Environment

• Production

• Predictable load

• SLA-drive

• Standard tools

• Exploratory, Ad Hoc

• Unpredictable load

• Experimentation

• Best tool for the job

#2) Embrace Hadoop To Create Next Gen ODS

Feeds production BI and Enterprise Data Warehouse environment and high-velocity Analytics Sandbox

Page 14: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

14 © Copyright 2013 EMC Corporation. All rights reserved.

#3) Leverage Hadoop To Create New Metrics Leverage HDFS to provide a single platform that supports your traditional SQL-based BI environment plus your growing unstructured data needs at scale

HDFS

HBase

Pig, Hive, Mahout

Map Reduce

Sqoop Flume

Resource Management & Workflow

Yarn

Zookeeper

Apache

Pivotal HD

Configure,

Deploy,

Monitor,

Manage

Command

Center

Hadoop Virtualization (HVE)

DataLoader

Xtension Framework

Catalog Services

Query Optimizer

Dynamic Pipelining

ANSI SQL + Analytics

HAWQ – Advanced Database Services

Page 15: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

15 © Copyright 2013 EMC Corporation. All rights reserved.

How To Get Started…

Analytics Operationalization

Identify current state, determine required state and conduct gap analysis to develop analytics implementation roadmap

Analytics Lab

Deploy analytics sandbox to quantify the business case

Vision Workshop

Identify big data analytics business use cases

Repeat the process for identified business cases

Page 16: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

16 © Copyright 2013 EMC Corporation. All rights reserved.

What Should You Look For in a Vendor?

Jeff Kelly

Page 17: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

17 © Copyright 2013 EMC Corporation. All rights reserved.

Advice For Selecting Big Data Vendors

Balance short-term goals with long-term vision

Objectives are:

Quick, demonstrable ROI

Sustainable Big Data practice

Don’t get hung up on “speeds and feeds” or feature-by-feature comparisons

Focus on substance, flexibility, commitment and experience

Page 18: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

18 © Copyright 2013 EMC Corporation. All rights reserved.

Selecting Big Data Vendors, Cont.

Evaluate products portfolios based on

Ability to monetize existing and future data assets

Ability to integrate with and compliment existing data management technology

Accessibility to power users and business users alike (depending on use case)

Ability to apply information governance and security best practices

Select service providers with track records of assisting enterprises adopt data-driven culture as well as technology

Page 19: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

19 © Copyright 2013 EMC Corporation. All rights reserved.

To type a question via WebEx, click on the Q&A tab

Please select “Ask: All Panelists”

to ensure your questions reach us. Thank you!

Questions and Answers

Page 20: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

20 © Copyright 2013 EMC Corporation. All rights reserved.

Learn More…

See us at… – EMC World, May 5-9 www.emc.world.com

Contact Jeff Kelly – Email: [email protected] – LinkedIn: http://www.linkedin.com/in/jeffreyfkelly/ – Twitter: @jeffreyfkelly – Research: http://www.wikibon.org/bigdata

Contact Bill Schmarzo – Email: [email protected] – Twitter: @schmarzo – Blogs: Big Data Business Model Maturity Chart

Most Excellent Big Data Strategy Document How We Teach Customers to Use Big Data

Page 21: Create Your Big Data Vision and Hadoop-ify Your Data Warehouse

21 © Copyright 2013 EMC Corporation. All rights reserved.

THANK YOU