Top Banner
The Infochimps Big Data Cloud Faster and Smarter Decision-Making 30 days from critical business problems to impactful insight. Our managed Big Data Platform-as-a-Service Cloud with proven application developer tools and infrastructure remove risk, accelerate deployment, and streamline your Big Data projects- enabling you to quickly start gaining insights, then scale to more data and use cases as you go.
21

Infochimps #1 Big Data Platform for the Cloud

Nov 01, 2014

Download

Documents

Brian Krpec

The Infochimps Platform is the simplest, fastest, and most flexible way to implement proven big data infrastructure in the cloud. Scalably and affordably ingest data from wherever you need — your in-house systems, external data feeds, data from the web, or our Data Marketplace. Make it useful with in-stream data decoration and augmentation. Store and analyze it in the best place for your application. Hadoop, NoSQL, real-time analytics — how do you tie it all together? The Infochimps Platform takes the mystery and difficulty out of big data and seamlessly integrates it with your existing environment, so you can focus on gaining business insights from your data fast.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Infochimps #1 Big Data Platform for the Cloud

The Infochimps Big Data Cloud! Faster and Smarter Decision-Making!

30 days from critical business problems to impactful insight. Our managed Big Data Platform-as-a-Service Cloud with proven application developer tools and infrastructure remove risk, accelerate deployment, and streamline your Big Data

projects- enabling you to quickly start gaining insights, then scale to more data and use cases as you go.

Page 2: Infochimps #1 Big Data Platform for the Cloud

big data cloud

+

=

Key Benefits !Fast!It only takes a few hours to deploy a complete solution to a public cloud or your private enterprise cloud. This means you can achieve immediate insights without sacrificing custom development ability.!

Simple!It shouldn‘t take a rocket scientist to tap into the insights Big Data can provide. We’ve created analytic services and application developer frameworks that make interacting with Big Data systems much easier by letting you use languages already familiar to you.!

Flexible!Our comprehensive architecture means you can combine real-time, ad-hoc, and batch analytics depending on your application needs. You can also start your system at the size that’s right for you, and grow it over time to additional data and use cases as your business evolves.!

Enterprise Ready!We reduce risk with the stability of our managed platform, our firm stance on data security, and our compatibility with many public, private, and hybrid cloud environments.!

2!

Critical Business Problems

Impactful Business Insights

Page 3: Infochimps #1 Big Data Platform for the Cloud

3!

Big Data Drivers!§  The proliferation of data

capture and creation technologies

§  Increased “interconnectedness” drives consumption (creating more data)

§  Inexpensive storage makes it possible to keep more, longer

§  Innovative software and analysis tools turn data into information

More Devices!

More Consumption!

More Content!

New & Better

Information!

Big Data encompasses not only the content itself, but how it’s analyzed and consumed.

§  Every gigabyte of stored content can generate a petabyte or more of transient data*

§  The information about you is much greater than the information you create

*Source: IDC 2011

Page 4: Infochimps #1 Big Data Platform for the Cloud

4!

Our Customers & Use Cases!

4!

Customer Segmentation Cisco is processing 100s of terabytes of weblog data to segment customers downloading software from their support portal by product, geography, and industry.

Social Media Listening Infomart built a brand new social media listening platform consuming100s of millions of messages from a variety of social networks in real-time, adding custom influence and authority scores, and building a simple front-end on top of Elasticsearch’s powerful API.

Mission Critical Data Pipeline Spongecell’s ad network produces over 10,000+ events per second and lost data means lost revenue. They built a robust, loss-free, high-volume data pipeline that processes all their events meaning they never worry about their data again.

Retail Analytics Koupon helps their large retail customers run marketing campaigns around mobile coupons. They collect data from mobile devices and add context around demographics and geolocation to provide their customers with in-depth insight about their customers.

Page 5: Infochimps #1 Big Data Platform for the Cloud

5!

Big Data Cloud Services: Overview!

5!

Data Integration and Real-Time Analytics Ad-Hoc Query and Near-Real-Time Analytics Batch Analytics

Page 6: Infochimps #1 Big Data Platform for the Cloud

6!

Big Data Cloud Services: Data Flow!

6!

Page 7: Infochimps #1 Big Data Platform for the Cloud

7!

Social Media Listening Platform!

7!

•  Sentiment Analysis •  Authority Scoring •  Influencer Ranking •  Gender Classifier

Analytics!

Application!

Page 8: Infochimps #1 Big Data Platform for the Cloud

8!

Ironfan™!

8!

!Ironfan is a systems provisioning, deployment, and updating tool. Ironfan automates not only machine configuration, but entire systems configuration to enable the complete Big Data stack, including data integration, routing, storage, computation, monitoring, and more.!!1.  Cycle time goes from weeks to minutes!2.  Service discovery means your machines auto-

wire themselves together!3.  Infrastructure-as-Code provides a simple,

iterative, testable contract for how your system will function!

4.  Leverages a combination of proprietary and open source code, including Chef and Fog!

Foundation for Your Big Data Services!

Page 9: Infochimps #1 Big Data Platform for the Cloud

9!

Data Delivery Service™!

9!

!Data Delivery Service™ (DDS) integrates seamlessly with your existing environment, provides highly scalable ETL (extract-transform-load) capabilities, and enables real-time, streaming data analytics.!!DDS™ gives you scalability & flexibility!!§  Tap into virtually any data source!

§  Internal!§  External!

§  Real-Time Stream Processing!§  Ingestions!§  Analytics!

§  Make Well-Informed Business Decisions!§  On-the-fly queries!

Data Integration & Real-Time Analytics!

Page 10: Infochimps #1 Big Data Platform for the Cloud

10!

Database Management!

10!

Ad-Hoc Query & Analytics!Whether it's HBase, Cassandra, Elasticsearch, MongoDB, MySQL, or others, we ensure the right data storage for the job is always right at your fingertips.!Database management gives you peace-of-mind!§  Databases and data storage, as a service. We are your

outsourced “Big Data” database administrator (DBA), providing !§  Database maintenance!§  Updates!§  Support!

§  Database Agnostics!§  Amazon S3!§  HBase!§  Cassandra!§  Elasticsearch!§  MongoDB!§  MySQL!§  + Many More!

§  Deploy to your internal cloud or to a public cloud!

Page 11: Infochimps #1 Big Data Platform for the Cloud

11!

Cloud Hadoop!

11!

Batch Analytics!Perform large-scale batch analysis as you need it, whether ad-hoc Hadoop clusters or always-on production workflows. Access all the tools you need, with on-demand scaling and tuning.!

Cloud Hadoop gives you cloud elasticity & efficiency!§  Turn clusters on at a moment’s notice!§  Scale and customize on the fly!§  Leverage tools that make Hadoop easier!

§  Wukong™!§  Pig!§  Hive!

§  Leverage tools that extend Hadoop!§  Azkaban!§  Sqoop!§  + more!

Video: Hadoop Cluster ! in 20 Minutes!

Page 12: Infochimps #1 Big Data Platform for the Cloud

12!

Wukong™!Simplified Scripts for Analytics!Wukong™ provides a simplified analytics scripting experience. Write your analytics in developer-friendly Ruby, run code locally for faster development cycles, and leverage existing analytics scripts.!Wukong™ gives you Superpowers!!§  Ruby for Big Data Analytics - That means you can use a familiar,

fun programming language to do both Hadoop jobs and DDS™ algorithms.!

!§  Quickly Iterate - Rather than developing and testing everything on

your production Hadoop and DDS™ clusters, you can develop scripts locally on your laptop.!

§  Leverage Familiar Standard-In/Standard-Out Language - Wukong™ can leverage your existing standard-in/standard-out code with Big Data.!

12!

Page 13: Infochimps #1 Big Data Platform for the Cloud

13!

Dashpot™!

13!

Reporting & Systems Management!Dashpot™ is a lightweight analytics and operations dashboard for administrators & developers!

Dashpot™ gives you visibility and control!!§  Real-Time visualizations from streaming data!§  Deep Visibility !

§  Individual Machines!§  Overall Systems!

§  Quickly Start & Stop functional units in your data clusters!

Page 14: Infochimps #1 Big Data Platform for the Cloud

14!

Platform API!

14!

Custom Applications and Dashboards!With a unified API, control of the platform and visibility of the data within it are just a few web requests away. !!

The Platform API gives you fine-grain control!!!§ HTTP-based API!

§  Simple JSON commands!

§ Access data through a simple, unified endpoint!§ Manage Platform Configuration Settings!

Page 15: Infochimps #1 Big Data Platform for the Cloud

Analytics

Bringing Big Data Analytics To Your Enterprise Data

big data cloud

15!

Page 16: Infochimps #1 Big Data Platform for the Cloud

Big Data Cloud

•  24 Month Project •  $1M for 10TB •  Analyzing 15% of

Enterprise Data

Traditional Data Infrastructure

Big Data Infrastructure

•  12 Month Project •  $300K for 10TB •  Analyzing up to 100% of

Enterprise Data

•  1 Month Project •  $10K / month for 10TB •  Analyzing up to 100% of

Enterprise Data + 15,000+ external sources

big data cloud

16!

Traditional vs. DIY vs. Infochimps!

Page 17: Infochimps #1 Big Data Platform for the Cloud

Data Center Infrastructure!‣  Lights Out Data Center!‣  Global Footprint!‣  Co-located with Data!‣  99.95 - 99.995% SLA!

17!

Cloud Delivery!

Page 18: Infochimps #1 Big Data Platform for the Cloud

Business Intelligence!‣  Visualize your data!‣  Business Reporting!‣  Application Integration!‣  Integrated with the Cloud!

18!

Cloud Delivery!

Page 19: Infochimps #1 Big Data Platform for the Cloud

Professional Services!‣  Big Data Planning!‣  Data Modeling!‣  Analytics!‣  Architecture/Design!‣  Implementation!

19!

Cloud Delivery!

Page 20: Infochimps #1 Big Data Platform for the Cloud

Infochimps Engagement Model!

Identify first use case, create proposal, design workflows and

iterate on architecture locally

Deploy initial design to development & staging cloud,

iteratively add functionality

Deploy to production public or private cloud

and scale out

20!

Page 21: Infochimps #1 Big Data Platform for the Cloud

Contact Information!

21!

Brian Krpec!Director of Sales 512-709-4704 cell [email protected] @bkrpec