Top Banner
Hops, the world's fastest Hadoop distribution JFokus, Feb 6 th 2018 Jim Dowling Assoc Prof @ KTH Senior Researcher @ RISE SICS CEO @ Logical Clocks AB www.hops.io @hopshadoop
63

Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Apr 20, 2018

Download

Documents

doananh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Hops, the world's fastest Hadoop distribution

JFokus, Feb 6th 2018

Jim Dowling Assoc Prof @ KTHSenior Researcher @ RISE SICSCEO @ Logical Clocks AB

www.hops.io@hopshadoop

Page 2: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Logical Clocks AB*

•Seed Funded- 12 FTEs

2018-02-11 2/48

* Logical Clocks are needed to build reliable distributed systems. There is no wall-clock time in distributed systems, so we use a Logical Clock to capture causality (happened-before) between processes.

Page 3: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Evolution of Hadoop

2018-02-11 3/31

2009 2018

?Tiny Brain

(NameNode, ResourceMgr)

Huge Body (DataNodes)

Page 4: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HopsFS: Next Generation HDFS*

16xThroughput

FasterBigger

*https://www.usenix.org/conference/fast17/technical-sessions/presentation/niazi

37xNumber of files

Scale Challenge Winner (2017)4

Page 5: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.
Page 6: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

What about our Cloud-Native Future?

App1Container

AppNContainer

KubeMaster

Host

Object Stores: S3 / GCS /etc

Object Stores, as they are today, are not the Future!

Page 7: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

The Problems with Object Stores*

•Object Stores are inferior to hierarchical filesystems- False equivalence*

•Object stores can scale better, but at what cost?- Read-your-writes existing objs on S3#- Write, then list- No Hierarchical namespace properties • Quotas, permissions

- Slow rename operations

False Equivalence*: “unfairly equating a minor failing to a major failing.”

Page 8: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

How Bad are existing Object Stores?

Big Companies are implementing their own eventually consistent metadata store to mirror the (unobservable) metadata in S3/GCS•Netflix for AWS

- s3mpr•Databricks DBFS•EmrFS

2018-02-11 8/31

http://techblog.netflix.com/2014/01/s3mper-consistency-in-cloud.html

Page 9: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HDFS: A POSIX-Style Distributed FS

2018-02-11 9/48

Page 10: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Apache HDFS

2018-02-11 10/31

Page 11: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HDFS Scalability Limitations

•Limited namespace- JVM Heap (~300 GB)

•Limited concurrency- Single global lock (single-writer, multiple readers)

2018-02-11 11/31

HFDSCLIENT

HFDSDATANODE

NAMENODE

Page 12: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

NameNode in the Tragedy of the Commons

2018-02-11 12/48

Single-Host NameNode

Page 13: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HopsFS

•Scale-out Metadata- Metadata in an in-memory distributed database- Multiple stateless Namenodes

•Remove the Global Metadata Lock- Multiple concurrent read and write operations

2018-02-11 13/31

Page 14: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HopsFS Architecture

2018-02-11 14/31

Page 15: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HopsFS Evaluation

15

• On Premise - Dual Intel® Xeon® E5-2620 v3

@2.40GHz- 256 GB RAM, 4 TB Disks

• 10 GbE- 0.1 ms ping latency between

nodes

*https://www.usenix.org/conference/fast17/technical-sessions/presentation/niazi

Page 16: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Evaluation: Spotify Workload

16

Page 17: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HopsFS Higher Throughput with Same Hardware

17

HopsFS outperforms with equivalent hardware: HA-HDFS with Five Servers● 1 Active Namenode● 1 Standby NameNode● 3 Servers

○ Journal Nodes○ ZooKeeper Nodes

Page 18: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Evaluation: Spotify Workload (contd.)

18

Page 19: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Evaluation: Spotify Workload (contd.)

19

Page 20: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Evaluation: Spotify Workload (contd.)

20

Page 21: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Evaluation: Spotify Workload (contd.)

21

16X the performance of HDFS.

Further scaling possible with more hardware

Page 22: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Write Intensive workloads

22

WorkloadsHopsFS ops/sec HDFS ops/sec Scaling Factor

Synthetic Workload (5.0% File Writes) 1.19 M 53.6 K 22Synthetic Workload (10% File Writes) 1.04 M 35.2 K 30

Synthetic Workload (20% File Writes) 0.748 M 19.9 K 37

Scalability of HopsFS and HDFS for write intensive workloads

Page 23: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Write Intensive workloads

23

WorkloadsHopsFS ops/sec HDFS ops/sec Scaling Factor

Synthetic Workload (5.0% File Writes) 1.19 M 53.6 K 22

Synthetic Workload (10% File Writes) 1.04 M 35.2 K 30

Synthetic Workload (20% File Writes) 0.748 M 19.9 K 37

Scalability of HopsFS and HDFS for write intensive workloads

Page 24: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Metadata Scalability

24

HDFS Metadata 200 GB → Max 500 million files/directories

HopsFS Metadata 24 TB → Max 17 Billion files/directories

�37 times more files than HDFS

Page 25: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

HopsFS Metadata Capacity

25Assuming 256MB Block Size, 100 GB JVM Heap for Apache Hadoop

Page 26: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Operational Latency

26

File System ClientsNo of Clients HopsFS Latency HDFS Latency

50 3.0 3.1

Page 27: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Operational Latency

27

File System ClientsNo of Clients HopsFS Latency HDFS Latency

50 3.0 3.1

1500 3.7 15.5

Page 28: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Operational Latency

28

File System ClientsNo of Clients HopsFS Latency HDFS Latency

50 3.0 3.11500 3.7 15.5

6500 6.8 67.4

Page 29: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Operational Latency

29

File System ClientsNo of Clients HopsFS Latency HDFS Latency

50 3.0 3.11500 3.7 15.5

6500 6.8 67.4

For 6500 clients HopsFS has 10 times lower latency than HDFS

Page 30: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Small Files in Hadoop

•In both Spotify and Yahoo 20% of the files are <= 4 KB

30

Page 31: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

*Niazi et al, Size Matters: Improving the Performance of Small Files in HDFS, Poster at Eurosys 2017

Small Files in HopsFS*

inode_id varbinary (on-disk column)

32123432 /tmp/smallFile.txt

31

•In HopsFS, we can store small files co-located with the metadata in MySQL Cluster as on-disk data.

Page 32: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

30 namenodes/datanodes and 6 NDB nodes were used. Small file size was 4 KB. HopsFs files were stored on Intel 750 Series SSDs

HopsFS Small Files Performance

32

Page 33: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Ongoing work: Multi-Data-Center HopsFS

33/103

NDB NDB

DN DN DN DN

Client

Synchronous Replication of Blocks

Network Partition Identification Service

NNNN NNNN

Asynchronous Multi-Master Replication of Metadata

Hops-eu-west1 Hops-eu-west2

Page 34: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

What do I do with all this Metadata?

34

Page 35: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Extending Metadata in Hops

Metadata APIpublic void attachMetadata(Json obj, String pathToFileorDir)public Json getMetadata(String pathToFileorDir)public void removeMetadata(Json obj, String pathToFileorDir)

DIY Extended Metadata•Add your own tables in the database

- Foreign keys + transactions ensure metadata consistency

2018-02-11 35/31

Page 36: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Elasticsearch

Strong Eventually Consistent Metadata

36

Database

Kafka

Epipe

Hive MetastoreChangelog

Free-Text Search for Files/Dirs in the HopsFS Namespace

Page 37: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Re-imagine Hadoop with Big Metadata

2018-02-11 37/48

Page 38: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Hopsworks = Big Data Science Platform

MySQL Cluster

Hive

InfluxDB

ElasticSearch

KafkaProjects,Datasets,Users

HopsFS / YARN

Spark, Flink, Tensorflow

Jupyter, Zeppelin

Jobs, Kibana, Grafana

RESTAPI

Java EE Application (AngularJS FrontEnd)

Page 39: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Projects in Hopsworks

39/70

DataLake

GPUs Compute

Kafka

Data EngineeringData Science

Elasticsearch

Only Hadoop Platform with Support for GPUs (Feb 2018)

Project1 ProjectNMetadata

Page 40: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Proj-42

Projects for Software-as-a-Service

A Project is a Grouping of Users and Data

Proj-X

Shared TopicTopic /Projs/My/Data

Proj-AllCompanyDB

40

Ismail et al, Hopsworks: Improving User Experience and Development on Hadoop with Scalable, Strongly Consistent Metadata, ICDCS 2017

Page 41: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Project Roles

•Data Owner Privileges- Import/Export data- Manage Membership- Share DataSets,Topics

•Data Scientist Privileges- Write and Run code

We delegate administration of privileges to users

Page 42: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Multi-Tenant Project-Based Platform

OLTPSystem

Logs

ExternSystems

Storage

Analysis

Assets

TradingData Lake

42

FX

DataSci

IT

Hopsworks

Reporting

Page 43: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Engineering

Kafka Topic

Hopsworks Projects in Practice

FX Project

FX Topic

FX DB

FX Data Stream

Shared Interactive Analytics

FX team

43

Page 44: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Hopsworks: Hadoop for Humans

www.hops.io 44

Page 45: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Manage Projects like GitHub

45

Page 46: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Share like in Dropbox

Share any Data Source/Sink: HDFS Datasets, Kafka Topics, etc

46

Page 47: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Modern Data Parallel Platforms

47

Page 48: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Workflow/Jobs and Notebook Support

48

Page 49: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Custom Python Environments with Conda

Python libraries are usable by Spark/Tensorflow

49

Page 50: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

How is Hopsworks Used?

2018-02-11 50/48

Page 51: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Parquet

ETL Workloads

51

Hive

Hopsworks Jobs

trigger

Elastic

pipelines transform raw datato structured data

HopsFS

DrElephant

performance

Page 52: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Business Intelligence Workloads

52

Jupyter/Zeppelinor Jobs

Kibanareports

Zeppelin/PixieDust

ParquetHiveElastic

Tableau

reports

Page 53: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

TF-Serving

InfluxDB

HopsworksIngress Kafka

…...MySQL

Streaming Analytics in Hopsworks

53

Notifications,Reporting

ParquetHiveElastic

Archival, ETL, Reporting

Page 54: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Experiments

FeatureStoreTensorflowServing

Tensorboard

DeepLearning-as-a-Service

54

Kafka

Hive

Hops

Page 55: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

TensorFlow/Hops on 10 1080Ti GPUs

55https://www.oreilly.com/ideas/distributed-tensorflow

Page 56: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Data Wrangling on Hopsworks

•Google Facets/Dive•Visualize datadistributions- Quick Understanding of underlying data

•Min/max/mean/mediavalues for features

•Missing values in columns

56

Page 57: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Making Spark/TensorFlow Easier to Program

Page 58: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

https://github.com/hopshadoop/hops-kafka-examples

Properties props = new Properties();props.put(ProducerConfig.BOOTSTRAP_SERVERS_CONFIG, brokerList);props.put(SCHEMA_REGISTRY_URL, restApp.restConnect);props.put(ProducerConfig.KEY_SERIALIZER_CLASS_CONFIG, org.apache.kafka.common.serialization.StringSerializer.class);props.put(ProducerConfig.VALUE_SERIALIZER_CLASS_CONFIG, io.confluent.kafka.serializers.KafkaAvroSerializer.class);props.put("producer.type", "sync");props.put("serializer.class","kafka.serializer.StringEncoder");props.put("request.required.acks", "1");props.put("ssl.keystore.location","/var/ssl/kafka.client.keystore.jks")props.put("ssl.keystore.password","test1234")props.put("ssl.key.password","test1234")ProducerConfig config = new ProducerConfig(props);String userSchema = "{\"namespace\": \"example.avro\", \"type\": \"record\", \"name\": \"User\"," +

"\"fields\": [{\"name\": \"name\", \"type\": \"string\"}]}";Schema.Parser parser = new Schema.Parser();Schema schema = parser.parse(userSchema);GenericRecord avroRecord = new GenericData.Record(schema);avroRecord.put("name", "testUser");Producer<String, String> producer = new Producer<String, String>(config);ProducerRecord<String, Object> message = new ProducerRecord<>(“topicName”, avroRecord );producer.send(data);

SparkProducer producer = HopsUtil.getSparkProducer();

HopsUtil simplifies Secure Spark/Kafka

58

Page 59: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Parallel Experiments on Spark/TensorFlow

#EUai859

def model_fn(learning_rate, dropout):…..

from hops import tflauncherargs_dict = {'learning_rate': [0.001, 0.005, 0.01],

'dropout': [0.5, 0.6, 0.7]}tflauncher.launch(spark, model_fn, args_dict)

Launches 3 Executors with 3 different Hyperparameter settings. Each Executor can have 1-N GPUs.

http://hops.readthedocs.io/en/latest/user_guide/hopsworks/tensorflow.html

Page 60: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Hadoop-as-a-Service in Sweden

• Hops• Spark/Flink/Kafka/TensorFlow/Hadoop-as-a-service• www.hops.site

• RISE SICS ICE• 250 kW Datacenter, ~1000 servers• Research and test environment

https://www.sics.se/projects/sics-ice-data-center-in-lulea 60

Page 61: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Summary

•Europe’s Only Hadoop Distribution – Hops Hadoop- Fully Open-Source- Larger/faster Hadoop Clusters

•Hopsworks is a new Data Platform built on HopsFS with first-class support for Streaming and ML- Spark or Flink- Tensorflow- Support services for Streaming Analytics and ML

Page 62: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

The Team

Jim Dowling, Seif Haridi, Tor Björn Minde, Gautier Berthou, Salman Niazi, Mahmoud Ismail, Theofilos Kakantousis, Ermias Gebremeskel, Antonios Kouzoupis, Alex Ormenisan, Fabio Buso, Robin Andersson, August Bonds, Mahmoud Ismail.

Active:

Alumni:

Vasileios Giannokostas, Johan Svedlund Nordström,Rizvi Hasan, Paul Mälzer, Bram Leenders, Juan Roca, Misganu Dessalegn, K “Sri” Srijeyanthan, Jude D’Souza, Alberto Lorente, Andre Moré, Ali Gholami, Davis Jaunzems, Stig Viaene, Hooman Peiro, Evangelos Savvidis, Steffen Grohsschmiedt, Qi Qi, Gayana Chandrasekara, Nikolaos Stanogias, Daniel Bali, Ioannis Kerkinos, Peter Buechler, Pushparaj Motamari, Hamid Afzali, Wasif Malik, Lalith Suresh, Mariano Valles, Ying Lieu, Fanti Machmount Al Samisti, Braulio Grana, Adam Alpire, Zahin Azher Rashid, ArunaKumari Yedurupaka, Tobias Johansson , Roberto Bampi, Filotas Siskos.

www.hops.io@hopshadoop

Page 63: Hops, the world's fastest Hadoop distribution - Jfokus the world's fastest Hadoop distribution JFokus, Feb 6th 2018 Jim Dowling ... *.

Thank You.Follow us: @hopshadoopStar us: http://github.com/hopshadoop/hopsworksJoin us: http://www.hops.io

Thank You.Follow us: @hopshadoopStar us: http://github.com/hopshadoop/hopsworksJoin us: http://www.hops.io

Hops