Top Banner
Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential
17

Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Dec 13, 2015

Download

Documents

Nigel Gibbs
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Anton Slutsky, Lead Data Scientist, EPAM Systems

Hadoop + Mahout

Confidential

Page 2: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 2

Agenda

Page 3: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 3

Machine Learning vs. Statistics

Page 4: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 4

Types of Machine Learning

Page 5: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 5

Machine Learning Applications

Page 6: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 6

Machine Learning and Data

Page 7: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 7

Obligatory Big Data Slide

Page 8: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 8

Hadoop

Page 9: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 9

Apache Mahout

Page 10: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 10

Why Hadoop + Mahout?

Page 11: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 11

Machine Learning Applications

Page 12: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 12

Machine Learning Applications

Page 13: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 13

Hadoop + Mahout Algorithm

Page 14: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 14

Get data into Hadoop

Page 15: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 15

Convert data into Mahout format

Page 16: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 16

Mahout format – Sequence File

Page 17: Anton Slutsky, Lead Data Scientist, EPAM Systems Hadoop + Mahout Confidential.

Confidential 17

Learn model from Data