BlueData Integration with Apache Ambari
Post on 08-Aug-2015
289 Views
Preview:
Transcript
Big Data Infrastructure Made EasyBlueData EPIC Integration with Apache Ambari
Overview of Apache Ambari
Apache Ambari is an open source management console for provisioning, managing and monitoring Hortonworks (HDP)
Hadoop clusters
Ambari provides a single control point for viewing, updating and managing Hadoop service life cycles, with these important
features:
BlueData: Easy, Cost-Effective, On-Demand
GLUSTER
HDFS SWIFT NFS
UTILIZATION > 90%
SIMPLIFIED MANAGEMENT
NO DUPLICATION
OF DATA
NO CLUSTER SPRAWL
ElasticPlane: Self-service, multi-tenant clusters
DataTap: In-place access to enterprise data stores
IOBoost: Virtualization with bare-metal performance
EPIC Software Platform
MINUTES TO SPIN UP A VIRTUAL
CLUSTER
R&D ManufacturingMarketing Sales
BlueData + Apache Ambari 1.7 Integration
Benefits FeaturesInfrastructure agility, elasticity, and efficiency – virtual HDP clusters with the functionality and performance of physical clusters
• Auto-provisioning of VM hosts with Ambari server and agent components
• Automated, transparent deployment of HDP using REST API for Stacks and Services
Time savings for Data Scientists and Big Data administrators
• Self-service virtual cluster creation by data scientists or business analysts
• Troubleshooting and management by Big Data admins using Ambari
Administrator productivity & flexibility • Ambari for monitoring, fine-grained configuration, and enterprise support
Delivering self-service with AmbariSelf-service web interface – define cluster with a few mouse clicks
* Example screenshot from BlueData
integration with Apache Ambari
Delivering self-service with AmbariCreating virtual Hadoop clusters with Ambari console within minutes
* Example screenshot from BlueData integration with Apache Ambari
Delivering self-service with AmbariCreating virtual Hadoop clusters within minutes
* Example screenshot from BlueData integration with Apache Ambari
Delivering self-service with AmbariHadoop cluster provisioning using Ambari API
Design optimized for cluster creation speed and user
feedback
Phase 1: VMs• Self-service request• VMs provisioned• Ambari server &
agents pre-deployed• HDFS dependency
removed
Phase 2: Core Stack• Agent registration with server• REST API call to deploy HDP stack• REST API to create core-site.xml
to use BlueData HDFS abstraction• Start YARN/MRv2• Shutdown HDFS service
Phase 3: Services• Add specific services
requested by end user via REST API calls
• Start ‘compute’ services (e.g. Hive, Pig) requested by user
• Update status of cluster
BlueData + Ambari: Big Data Infrastructure Made Easy
SPEED & AGILITY
SECURITY & CONTROL
EFFICIENCY & LOWER
COST
70%Savings
top related