Anton Slutsky, Lead Data Scientist, EPAM Systems
Hadoop + Mahout
Confidential
Confidential 2
Agenda
Confidential 3
Machine Learning vs. Statistics
Confidential 4
Types of Machine Learning
Confidential 5
Machine Learning Applications
Confidential 6
Machine Learning and Data
Confidential 7
Obligatory Big Data Slide
Confidential 8
Hadoop
Confidential 9
Apache Mahout
Confidential 10
Why Hadoop + Mahout?
Confidential 11
Machine Learning Applications
Confidential 12
Machine Learning Applications
Confidential 13
Hadoop + Mahout Algorithm
Confidential 14
Get data into Hadoop
Confidential 15
Convert data into Mahout format
Confidential 16
Mahout format – Sequence File
Confidential 17
Learn model from Data