Transcript

Big Data Infrastructure Made EasyBlueData EPIC Integration with Apache Ambari

Overview of Apache Ambari

Apache Ambari is an open source management console for provisioning, managing and monitoring Hortonworks (HDP)

Hadoop clusters

Ambari provides a single control point for viewing, updating and managing Hadoop service life cycles, with these important

features:

BlueData: Easy, Cost-Effective, On-Demand

GLUSTER

HDFS SWIFT NFS

UTILIZATION > 90%

SIMPLIFIED MANAGEMENT

NO DUPLICATION

OF DATA

NO CLUSTER SPRAWL

ElasticPlane: Self-service, multi-tenant clusters

DataTap: In-place access to enterprise data stores

IOBoost: Virtualization with bare-metal performance

EPIC Software Platform

MINUTES TO SPIN UP A VIRTUAL

CLUSTER

R&D ManufacturingMarketing Sales

BlueData + Apache Ambari 1.7 Integration

Benefits FeaturesInfrastructure agility, elasticity, and efficiency – virtual HDP clusters with the functionality and performance of physical clusters

• Auto-provisioning of VM hosts with Ambari server and agent components

• Automated, transparent deployment of HDP using REST API for Stacks and Services

Time savings for Data Scientists and Big Data administrators

• Self-service virtual cluster creation by data scientists or business analysts

• Troubleshooting and management by Big Data admins using Ambari

Administrator productivity & flexibility • Ambari for monitoring, fine-grained configuration, and enterprise support

Delivering self-service with AmbariSelf-service web interface – define cluster with a few mouse clicks

* Example screenshot from BlueData

integration with Apache Ambari

Delivering self-service with AmbariCreating virtual Hadoop clusters with Ambari console within minutes

* Example screenshot from BlueData integration with Apache Ambari

Delivering self-service with AmbariCreating virtual Hadoop clusters within minutes

* Example screenshot from BlueData integration with Apache Ambari

Delivering self-service with AmbariHadoop cluster provisioning using Ambari API

Design optimized for cluster creation speed and user

feedback

Phase 1: VMs• Self-service request• VMs provisioned• Ambari server &

agents pre-deployed• HDFS dependency

removed

Phase 2: Core Stack• Agent registration with server• REST API call to deploy HDP stack• REST API to create core-site.xml

to use BlueData HDFS abstraction• Start YARN/MRv2• Shutdown HDFS service

Phase 3: Services• Add specific services

requested by end user via REST API calls

• Start ‘compute’ services (e.g. Hive, Pig) requested by user

• Update status of cluster

BlueData + Ambari: Big Data Infrastructure Made Easy

SPEED & AGILITY

SECURITY & CONTROL

EFFICIENCY & LOWER

COST

70%Savings

top related