DOCUMENT RESOURCES FOR EVERYONE
The top documents on
Introduction to Apache Spark and MLlib
Data & AnalyticsIntroduction to Apache Spark and MLlib

Slide deck contains overview of Apache Spark and Machine learning library MLlib.

Simple math for anomaly detection   toufic boubez - metafor software - monitorama pdx 2014-05-05
Data & AnalyticsSimple math for anomaly detection toufic boubez - metafor software - monitorama pdx 2014-05-05

This is my presentation at Monitorama PDX in Portland on May 5, 2014 Simple math to get some signal out of your noisy sea of data You’ve instrumented your system and application…

Alpine Spark Implementation - Technical
Data & AnalyticsAlpine Spark Implementation - Technical

Alpine Data Labs presents a deep dive into our implementation of Multinomial Logistic Regression with Apache Spark. Machine Learning Engineer DB Tsai takes us through the…

Practical Data Management - ACRL DCIG Webinar
Data & AnalyticsPractical Data Management - ACRL DCIG Webinar

Slides from an ACRL DCIG webinar from 30 April 2014 discussing basic data management practices in file organization and naming, documentation, storage and backup, and making…

Molecules of Knowledge: Self-Organisation in Knowledge-Intensive Environments
Data & AnalyticsMolecules of Knowledge: Self-Organisation in Knowledge-Intensive Environments

Molecules of Knowledge (MoK) is a coordination model supporting self-organisation of knowledge in Knowledge Intensive Environments (KIE). Usual approaches to knowledge management…

Agile Data Science: Building Hadoop Analytics Applications
Data & AnalyticsAgile Data Science: Building Hadoop Analytics Applications

Presentation of the content of the O'Reilly book, Agile Data Science. Applied data science on Hadoop.