Slide deck contains overview of Apache Spark and Machine learning library MLlib.
Presentation for the Multilingual Web workshop, May 8, 2014
This is my presentation at Monitorama PDX in Portland on May 5, 2014 Simple math to get some signal out of your noisy sea of data You’ve instrumented your system and application…
Alpine Data Labs presents a deep dive into our implementation of Multinomial Logistic Regression with Apache Spark. Machine Learning Engineer DB Tsai takes us through the…
Slides from an ACRL DCIG webinar from 30 April 2014 discussing basic data management practices in file organization and naming, documentation, storage and backup, and making…
Molecules of Knowledge (MoK) is a coordination model supporting self-organisation of knowledge in Knowledge Intensive Environments (KIE). Usual approaches to knowledge management…
Presentation of the content of the O'Reilly book, Agile Data Science. Applied data science on Hadoop.