1 © Copyright 2013 EMC Corporation. All rights reserved. Create Your Big Data Vision And Hadoop-ify Your Data Warehouse Jeff Kelly, Big Data Analyst The Wikibon Project Bill Schmarzo, CTO EIMA Practice, EMC Professional Services
Nov 01, 2014
1 © Copyright 2013 EMC Corporation. All rights reserved.
Create Your Big Data Vision And Hadoop-ify Your Data Warehouse
Jeff Kelly, Big Data Analyst
The Wikibon Project
Bill Schmarzo, CTO EIMA Practice, EMC Professional
Services
2 © Copyright 2013 EMC Corporation. All rights reserved.
Agenda
Current Market Observations
The Big Data Business Maturity Index and How to Identify Your Best Use Case
Get Started With Hadoop and Other New Technologies
What Should You Look For in a Vendor?
Q&A
3 © Copyright 2013 EMC Corporation. All rights reserved.
Current Market Observations
Jeff Kelly
4 © Copyright 2013 EMC Corporation. All rights reserved.
Big Data Market Size
2012
$11.4b
2013
$18.2b
2017
$48b
59% Growth Y-o-Y 2011 to 2012
Forecast 60%+ Growth in 2013
31% CAGR Forecast 2012 through 2017
2014 $28b
2015 $37.9b
2016 $43.7b
Source: Wikibon Big Data Vendor Revenue and Market Forecast, 2012-2017
5 © Copyright 2013 EMC Corporation. All rights reserved.
Big Data Market Segmentation, 2012 Services Leading the Way
Professional Services
$3,784m 34%
Cloud and SaaS $608m
5%
0, 0%
0, 0% 0, 0% 0, 0% 0, 0% Pro. Services
Compute
Storage
Networking
Database
Applications
Data mgt.
Cloud n = $11,400m
6 © Copyright 2013 EMC Corporation. All rights reserved.
Big Data Growth Drivers
Continued Investment by Web Pioneers and Three Letter Agencies Google alone spent $1b+ on infrastructure in Q4 2012 “Everything we do is a Big Data problem.” – Jay Parikh, VP of Engineering, Facebook CIA CTO Ira Hunt: Our mission is to “collect everything and hang on to it forever”
Increased Awareness and Investments By Large Enterprises Beyond the Web Retailers like Sears leveraging Big Data for price
optimization Financial services firms, including JPMC, Morgan
Stanley and BoA, conduct fraud analysis, risk profiling and more
Pharmaceutical including Bristol Myers Squibb makers use Big Data to support drug development
7 © Copyright 2013 EMC Corporation. All rights reserved.
Big Data Growth Drivers, Cont.
Increasingly Sophisticated Professional Services Professional services building on experience of assisting early
adopters Some (but not all) are vendor and product agnostic Focusing on identifying use cases, improving communication, and
leveraging existing assets
Technology Maturation Open source community and vendors
making Hadoop enterprise-ready, easier to use
Better integration between Big Data and existing IT infrastructure
Extending Big Data accessibility to business users via BI and data visualization tools.
Consulting
Training & Educations
Integration
8 © Copyright 2013 EMC Corporation. All rights reserved.
Big Data Growth Inhibitors Lack of Data Scientists and Big Data
Practitioners
Big Data Technology Still Complex, Difficult to Manage/Use
Organizational Resistance to Data-Driven Decision Making
Confusion Due to Vendor Marketing and “Big Data Washing”
Big Data [Your Product Name Here]
9 © Copyright 2013 EMC Corporation. All rights reserved.
Identify Your Best Big Data Use Case
Bill Schmarzo
CTO, EIM&A Practice EMC Consulting
10 © Copyright 2013 EMC Corporation. All rights reserved.
Business Metamorphosis
Data Monetization
Business Optimization
Business Insights
Business Monitoring
Big Data Business Model Maturation Index Measures the degree to which the
organization has integrated big data
and advanced analytics into their
business
11 © Copyright 2013 EMC Corporation. All rights reserved.
Get Started With Hadoop And Other New Technologies
#1) Increase Data Platform Performance With MPP Platform
#2) Embrace Hadoop To Create Next Gen ODS/Data Staging
#3) Leverage Hadoop To Create New Unstructured Data Metrics
#4) Extend Data Warehouse Via Data Virtualization
#5) Deploy In-database Analytics To Accelerate Analytics
12 © Copyright 2013 EMC Corporation. All rights reserved.
• Massively Parallel Processing (MPP), scale-out architectures provide cost effective options for managing and analyzing massive volumes of structured and unstructured data
• MPP data warehouses provide linear scalability on general purpose, commodity systems
#1) Increase Data Platform Performance With MPP
13 © Copyright 2013 EMC Corporation. All rights reserved.
Hadoop Data Store Analytics Environment
Data Preparation and Enrichment
ALL data fed into Hadoop Data Store
EDW ETL
Analytic Sandbox
BI Environment
• Production
• Predictable load
• SLA-drive
• Standard tools
• Exploratory, Ad Hoc
• Unpredictable load
• Experimentation
• Best tool for the job
#2) Embrace Hadoop To Create Next Gen ODS
Feeds production BI and Enterprise Data Warehouse environment and high-velocity Analytics Sandbox
14 © Copyright 2013 EMC Corporation. All rights reserved.
#3) Leverage Hadoop To Create New Metrics Leverage HDFS to provide a single platform that supports your traditional SQL-based BI environment plus your growing unstructured data needs at scale
HDFS
HBase
Pig, Hive, Mahout
Map Reduce
Sqoop Flume
Resource Management & Workflow
Yarn
Zookeeper
Apache
Pivotal HD
Configure,
Deploy,
Monitor,
Manage
Command
Center
Hadoop Virtualization (HVE)
DataLoader
Xtension Framework
Catalog Services
Query Optimizer
Dynamic Pipelining
ANSI SQL + Analytics
HAWQ – Advanced Database Services
15 © Copyright 2013 EMC Corporation. All rights reserved.
How To Get Started…
Analytics Operationalization
Identify current state, determine required state and conduct gap analysis to develop analytics implementation roadmap
Analytics Lab
Deploy analytics sandbox to quantify the business case
Vision Workshop
Identify big data analytics business use cases
Repeat the process for identified business cases
16 © Copyright 2013 EMC Corporation. All rights reserved.
What Should You Look For in a Vendor?
Jeff Kelly
17 © Copyright 2013 EMC Corporation. All rights reserved.
Advice For Selecting Big Data Vendors
Balance short-term goals with long-term vision
Objectives are:
Quick, demonstrable ROI
Sustainable Big Data practice
Don’t get hung up on “speeds and feeds” or feature-by-feature comparisons
Focus on substance, flexibility, commitment and experience
18 © Copyright 2013 EMC Corporation. All rights reserved.
Selecting Big Data Vendors, Cont.
Evaluate products portfolios based on
Ability to monetize existing and future data assets
Ability to integrate with and compliment existing data management technology
Accessibility to power users and business users alike (depending on use case)
Ability to apply information governance and security best practices
Select service providers with track records of assisting enterprises adopt data-driven culture as well as technology
19 © Copyright 2013 EMC Corporation. All rights reserved.
To type a question via WebEx, click on the Q&A tab
Please select “Ask: All Panelists”
to ensure your questions reach us. Thank you!
Questions and Answers
20 © Copyright 2013 EMC Corporation. All rights reserved.
Learn More…
See us at… – EMC World, May 5-9 www.emc.world.com
Contact Jeff Kelly – Email: [email protected] – LinkedIn: http://www.linkedin.com/in/jeffreyfkelly/ – Twitter: @jeffreyfkelly – Research: http://www.wikibon.org/bigdata
Contact Bill Schmarzo – Email: [email protected] – Twitter: @schmarzo – Blogs: Big Data Business Model Maturity Chart
Most Excellent Big Data Strategy Document How We Teach Customers to Use Big Data
21 © Copyright 2013 EMC Corporation. All rights reserved.
THANK YOU