Top Banner
Bootstrapping Machine Learning Louis Dorard (@louisdorard)
44
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Bootstrapping Machine Learning

BootstrappingMachine Learning

Louis Dorard (@louisdorard)

Page 2: Bootstrapping Machine Learning
Page 3: Bootstrapping Machine Learning
Page 4: Bootstrapping Machine Learning
Page 5: Bootstrapping Machine Learning

–Waqar Hasan, Apigee Insights

“Predictive is the ‘killer app’ for big data.”

Page 6: Bootstrapping Machine Learning

–Mike Gualtieri, Principal Analyst at Forrester

“Predictive apps are the next big thing

in app development.”

Page 7: Bootstrapping Machine Learning

Machine Learning

Page 8: Bootstrapping Machine Learning

Data

Page 9: Bootstrapping Machine Learning

BUT

Page 10: Bootstrapping Machine Learning

–McKinsey & Co.

“A significant constraint on realizing value from big data will be a shortage of talent, particularly of people with deep expertise in statistics

and machine learning.”

Page 11: Bootstrapping Machine Learning

What the @#?~% is ML?

Page 12: Bootstrapping Machine Learning
Page 13: Bootstrapping Machine Learning

“How much is this house worth? — X $” -> Regression

Page 14: Bootstrapping Machine Learning

Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)

3 1 860 1950 house 565,000

3 1 1012 1951 house

2 1.5 968 1976 townhouse 447,000

4 1315 1950 house 648,000

3 2 1599 1964 house

3 2 987 1951 townhouse 790,0001 1 530 2007 condo 122,0004 2 1574 1964 house 835,000

4 2001 house 855,000

3 2.5 1472 2005 house

4 3.5 1714 2005 townhouse

2 2 1113 1999 condo

1 769 1999 condo 315,000

Page 15: Bootstrapping Machine Learning

Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)

3 1 860 1950 house 565,000

3 1 1012 1951 house

2 1.5 968 1976 townhouse 447,000

4 1315 1950 house 648,000

3 2 1599 1964 house

3 2 987 1951 townhouse 790,0001 1 530 2007 condo 122,0004 2 1574 1964 house 835,000

4 2001 house 855,000

3 2.5 1472 2005 house

4 3.5 1714 2005 townhouse

2 2 1113 1999 condo

1 769 1999 condo 315,000

Page 16: Bootstrapping Machine Learning
Page 17: Bootstrapping Machine Learning

Bedrooms Bathrooms Surface (foot²) Year built Type Price ($)

3 1 860 1950 house 565,000

3 1 1012 1951 house

2 1.5 968 1976 townhouse 447,000

4 1315 1950 house 648,000

3 2 1599 1964 house

3 2 987 1951 townhouse 790,0001 1 530 2007 condo 122,0004 2 1574 1964 house 835,000

4 2001 house 855,000

3 2.5 1472 2005 house

4 3.5 1714 2005 townhouse

2 2 1113 1999 condo

1 769 1999 condo 315,000

Page 18: Bootstrapping Machine Learning

ML is a set of AI techniques where “intelligence” is built by

referring to examples

Page 19: Bootstrapping Machine Learning
Page 20: Bootstrapping Machine Learning

??

Page 21: Bootstrapping Machine Learning

Prediction APIs to the rescue

Page 22: Bootstrapping Machine Learning
Page 23: Bootstrapping Machine Learning

HTML / CSS / JavaScript

Page 24: Bootstrapping Machine Learning

HTML / CSS / JavaScript

Page 25: Bootstrapping Machine Learning

squarespace.com

Page 26: Bootstrapping Machine Learning
Page 27: Bootstrapping Machine Learning
Page 28: Bootstrapping Machine Learning

The two phases of machine learning:

• TRAIN a model

• PREDICT with a model

Page 29: Bootstrapping Machine Learning

The two methods of prediction APIs:

• TRAIN a model

• PREDICT with a model

Page 30: Bootstrapping Machine Learning

The two methods of prediction APIs: • model = create_model(dataset)

• predicted_output = create_prediction(model, new_input)

Page 31: Bootstrapping Machine Learning

from bigml.api import BigML

# create a modelapi = BigML()source = api.create_source('training_data.csv')dataset = api.create_dataset(source)model = api.create_model(dataset)

# make a predictionprediction = api.create_prediction(model, new_input)print "Predicted output value: ",prediction['object']['output']

http://bit.ly/bigml_wakari

Page 32: Bootstrapping Machine Learning
Page 33: Bootstrapping Machine Learning

Automated Prediction APIs:

• BigML.com

• Google Prediction API

• WolframCloud.com

Page 34: Bootstrapping Machine Learning

Good Data

Page 35: Bootstrapping Machine Learning

• List assumptions (e.g. big houses are expensive)

• Browse data

• Plot data (with BigML for instance)

Page 36: Bootstrapping Machine Learning

bit.ly/5minPandas

Page 37: Bootstrapping Machine Learning

Model building

Page 38: Bootstrapping Machine Learning

bit.ly/5minML

Page 39: Bootstrapping Machine Learning

Evaluation:

• Train/test split

• Predictions accuracy

• Impact on app / UX /business

• Cross validation

• Time taken: training and predictions

Page 40: Bootstrapping Machine Learning

Recap

Page 41: Bootstrapping Machine Learning

• Classification and regression

• 2 phases in ML: train and predict

• Prediction APIs make it easy to build models, but need to work on data

• Evaluation: split data, measure accuracy, time, impact

• Limitations: # data points, # features and noise

Page 42: Bootstrapping Machine Learning
Page 43: Bootstrapping Machine Learning

www.louisdorard.com

@louisdorard

Page 44: Bootstrapping Machine Learning