Top Banner
October 2016 Predictive Analytics Big Data & Artificial Intelligence
27

Predictive Analytics - Big Data & Artificial Intelligence

Apr 16, 2017

Download

Technology

Manish Jain
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Predictive Analytics - Big Data & Artificial Intelligence

October2016

PredictiveAnalyticsBigData&ArtificialIntelligence

Page 2: Predictive Analytics - Big Data & Artificial Intelligence

Agenda

ArtificialIntelligence AI

BigDataMachineLearning

DeepLearning

NeuralNetworks

NLPNaturalLanguageProcessing

Demystifythefollowingbuzzwords.

ImageRecognition2

Page 3: Predictive Analytics - Big Data & Artificial Intelligence

UltimateGoal:PredictiveAnalytics

Predictwhatuserswillwanttobuy.

AconsumersearchesforaTVandbasedonpreviouscustomersdata,showaproductthathasahighprobabilityofbeingboughtaswell.

3

Page 4: Predictive Analytics - Big Data & Artificial Intelligence

EvolutionofDataAnalytics1990s 2000s

Excel BusinessIntelligence(BI)Dashboards

2015andbeyond

ActionableInsights

WhatHappened? What’sHappening? WhatWillHappen?

4

Page 5: Predictive Analytics - Big Data & Artificial Intelligence

TheProcess

Structuredandunstructured(ex.

video)data

Dataisstoredindatabasesand

servers

DataGenerated

DataStored

ActionableInsights

DataProcessing

ProcessthedatausingCPU/GPUsandAIalgorithmstodetectpatterns

Predictivesignalsaregenerated

CentralProcessingUnit(CPU)/GraphicsProcessingUnit(GPU)

BigData ArtificialIntelligence

5

Page 6: Predictive Analytics - Big Data & Artificial Intelligence

HowDidWeGetHere?Databases(the80s)

DataWarehousing(the90s)

• Relationaldatabases• Gigabytesinsize• Lowlatency

• Terabytesinsize• Customhardware

6

Page 7: Predictive Analytics - Big Data & Artificial Intelligence

Today,it’sBigData

7

Page 8: Predictive Analytics - Big Data & Artificial Intelligence

ArtificialIntelligence(AI)

8

Page 9: Predictive Analytics - Big Data & Artificial Intelligence

ArtificialIntelligence(AI)

9

Page 10: Predictive Analytics - Big Data & Artificial Intelligence

WhenToUseMachineLearning

Apatternexists1

Wecannotpindownthepatternmathematically

2

Wehavedataandhopefullylotsofdata

10

Page 11: Predictive Analytics - Big Data & Artificial Intelligence

TypesofMachineLearning

11

Page 12: Predictive Analytics - Big Data & Artificial Intelligence

SupervisedLearning

X

X

XX

X

Price

SquareFeet

Weknowwhatwearetryingtopredict.Weusesomeexamplesthatweandthemodelknowtheanswersto“train”ourmodel.Itcanthengeneratepredictionstoexampleswedon’tknowtheanswerto.

Example:Predictthepriceofahousebasedonthesizeofthehouse.

XX

12

Page 13: Predictive Analytics - Big Data & Artificial Intelligence

UnsupervisedLearning

OO OO

O

OOOOO

X

Y

OOO OO

Wedon’tknowwhatwearetryingtopredict.Wearetryingtoidentifysomenaturallyoccurringpatternsinthedatawhichmaybeinformative.

Example:Trytoidentify“clusters”ofcustomersbasedonthedatawehaveonthem.

13

Page 14: Predictive Analytics - Big Data & Artificial Intelligence

WhatisDeepLearning?• DeepLearningandNeuralNetworksaresynonymous

• It’sabranchofmachinelearningbasedonasetofalgorithmsthatattempttomodelhighlevelabstractionsindatabyusingadeepgraphwithmultipleprocessinglayers,composedofmultiplelinearandnon-lineartransformations

Whatwesee Whatthecomputer“sees” 14

Page 15: Predictive Analytics - Big Data & Artificial Intelligence

ToolsofTheTrade

ApacheSystemML

GoogleCloudMachineLearning

15

Page 16: Predictive Analytics - Big Data & Artificial Intelligence

[email protected]

Questions?

version:draft

Page 17: Predictive Analytics - Big Data & Artificial Intelligence

Appendix

17

Page 18: Predictive Analytics - Big Data & Artificial Intelligence

AIResearchersGeoffreyHinton

UniversityofTorontoGoogle

Yoshua Bengio

UniversityofMontreal

YannLeCun

NewYorkUniversityFacebook

AndrewNg

StanfordUniversityBaidu

18

Page 19: Predictive Analytics - Big Data & Artificial Intelligence

CPUvsGPUPerformance

19

Page 20: Predictive Analytics - Big Data & Artificial Intelligence

MapReduce

20

Page 21: Predictive Analytics - Big Data & Artificial Intelligence

TheName…Hadoop

NamedaftertheyellowtoyelephantofDougCutting’sson.

In2006whileworkingatYahoo,DougcameupwiththeHadoopframework.In2008,itwastakenoverbytheopensourcegroup

Apache,hencetheofficialnameisApacheHadoop.21

Page 22: Predictive Analytics - Big Data & Artificial Intelligence

HadooptotheRescue“anopensourceframeworkwritteninJavaforstoringand

processingmassiveamountsofdatainadistributedmanner”

1HadoopDistributedFileSystem(HDFS).Scalablefilesystemthatdistributesandstoresdataacrossmanymachinesinacluster.

MapReduce – frameworkfordistributedprocessing.

2KeyComponentsoftheFramework:

Storage 2 Analysis

22

Page 23: Predictive Analytics - Big Data & Artificial Intelligence

Hadoop Architecture

Hadoopcanrunoncheapcommoditizedhardwareonpremiseorinthecloud.

Storesfilesinlargeblocks(64MB)acrossmultiplemachinesforfaulttolerance.Bydefault,dataisstoredon3separatemachines

HDFS

MapReduceBreakslargedataprocessingproblems intomultiple steps,namelyMappers(DataNode)andReducers(TaskTrackers)thatcanbeworkedoninparallelonmultiplemachines

23

Page 24: Predictive Analytics - Big Data & Artificial Intelligence

MapReduce StoreSalesData(100MB)

Mappers NameNode1 DataNode1(64MB)

DataNode2(36MB)

LA NYC LA NYC

Reducers JobTracker TaskTracker1

LA LA

TaskTracker2

NYC NYC

ShuffleandSort

24

Page 25: Predictive Analytics - Big Data & Artificial Intelligence

MapReduceMap Shuffle&Sort Reduce Result

25

Page 26: Predictive Analytics - Big Data & Artificial Intelligence

Hadoop1.0vs2.0

26

Page 27: Predictive Analytics - Big Data & Artificial Intelligence

TheFuture…

27