DOCUMENT RESOURCES FOR EVERYONE
Documents tagged
Data & Analytics Introduction to Apache Spark and MLlib

Slide deck contains overview of Apache Spark and Machine learning library MLlib.

Data & Analytics Deploying and managing SolrCloud in the cloud using the Solr Scale Toolkit

SolrCloud is a set of features in Apache Solr that enable elastic scaling of search indexes using sharding and replication. In this presentation, Tim Potter will demonstrate…

Education Datascience Introduction WebSci Summer School 2014

http://www.summerschool.websci.net/ WebScience Summer School Southampton Data Science 2014

Engineering Scaling Big Data with Hadoop and Mesos

As a company starts dealing with large amounts of data, operation engineers are challenged with managing the influx of information while ensuring the resilience of data.…

Engineering Cloud schedulers and Scheduling in Hadoop

This presentation describes some of the features of different cloud schedulers

Internet Vert.x clustering on Docker, CoreOS and ETCD

This talk was held at the Vert.x Meetup Amsterdam on 30-07-2014. The subject is on how to get a Vert.X cluster running in Docker containers running on CoreOS without any…