Top Banner
Tying up loose ends
34

Tying up loose ends. Understand your data No answers available, only data.

Dec 27, 2015

Download

Documents

Daniella Park
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Tying up loose ends.  Understand your data  No answers available, only data.

Tying up loose ends

Page 2: Tying up loose ends.  Understand your data  No answers available, only data.

Understand your data

Page 3: Tying up loose ends.  Understand your data  No answers available, only data.

No answers available, only data

Page 4: Tying up loose ends.  Understand your data  No answers available, only data.

No answers available, only dataClustering, SOM, Hebbian learning,

PCA…

Page 5: Tying up loose ends.  Understand your data  No answers available, only data.

Training includes inputs and correct answers

Page 6: Tying up loose ends.  Understand your data  No answers available, only data.

Training includes inputs and correct answers

Perceptron, Backprop, POS tagging

Page 7: Tying up loose ends.  Understand your data  No answers available, only data.

Probability of Y given X

Page 8: Tying up loose ends.  Understand your data  No answers available, only data.

Probability of Y given XOr the most likely Y given X

Page 9: Tying up loose ends.  Understand your data  No answers available, only data.

Probability of Y given XOr the most likely Y given XCollaborative Filtering – people who

like X probably like Y

Page 10: Tying up loose ends.  Understand your data  No answers available, only data.

Probability of Y given XOr the most likely Y given XCollaborative Filtering – people who

like X probably like YNeural Networks – input X triggers Y

output (behaviorism)

Page 11: Tying up loose ends.  Understand your data  No answers available, only data.

Input retrieves similarities or correlations as output

Page 12: Tying up loose ends.  Understand your data  No answers available, only data.

X is a…

Page 13: Tying up loose ends.  Understand your data  No answers available, only data.

X is a…X is A or B or C or D

Page 14: Tying up loose ends.  Understand your data  No answers available, only data.

X is a…X is A or B or C or DX is 1 or 0

Page 15: Tying up loose ends.  Understand your data  No answers available, only data.

X is a…X is A or B or C or DX is face or not-face

Page 16: Tying up loose ends.  Understand your data  No answers available, only data.

Goal is prediction

Page 17: Tying up loose ends.  Understand your data  No answers available, only data.

Goal is predictionClassification is a type of association

Page 18: Tying up loose ends.  Understand your data  No answers available, only data.

Goal is predictionClassification is a type of association Includes pattern recognition: OCR,

faces, diagnosis, speech, NLP…

Page 19: Tying up loose ends.  Understand your data  No answers available, only data.

Goal is predictionClassification is a type of association Includes pattern recognition: OCR,

faces, diagnosis, speech, NLP… Includes compression

Page 20: Tying up loose ends.  Understand your data  No answers available, only data.

If the output is a continuous number

Page 21: Tying up loose ends.  Understand your data  No answers available, only data.

If the output is a continuous numberEx. Automatic steering

inputs: sensors (video, GPS, proximity…)

output: degree of rotation of the wheel Ex. ALVINN

Page 22: Tying up loose ends.  Understand your data  No answers available, only data.

Backprop Neural Nets work for both

Page 23: Tying up loose ends.  Understand your data  No answers available, only data.

Different algorithms use different error calculations

Page 24: Tying up loose ends.  Understand your data  No answers available, only data.

Different algorithms use different error calculations

Simplest : # wrong / # total ie. 2/5 = .4 or 40%

Page 25: Tying up loose ends.  Understand your data  No answers available, only data.

Different algorithms use different error calculations

Simplest : # wrong / # total ie. 2/5 = .4 or 40%

Other examples: WER Mean Squared Error

Page 26: Tying up loose ends.  Understand your data  No answers available, only data.

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

Page 27: Tying up loose ends.  Understand your data  No answers available, only data.

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput Validation

Training

Page 28: Tying up loose ends.  Understand your data  No answers available, only data.

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

Fold 1

Fold 2

Fold 3

Fold 4

Fold 5

Page 29: Tying up loose ends.  Understand your data  No answers available, only data.

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

Fold 1

Fold 2

Fold 3

Fold 4

Fold 5

Train

Test

-> Learner 1 error = .01

Page 30: Tying up loose ends.  Understand your data  No answers available, only data.

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

Fold 1

Fold 2

Fold 3

Fold 5

Fold 4

Train

Test

-> Learner 2 error = .012

Page 31: Tying up loose ends.  Understand your data  No answers available, only data.

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

InputInput OutputOutputInputInput OutputOutput

Fold 1

Fold 2

Fold 3

Fold 5

Fold 3

Train

Test

-> Learner 3 error = .011

Page 32: Tying up loose ends.  Understand your data  No answers available, only data.

If errors between folds vary greatly this indicated bias in training

Page 33: Tying up loose ends.  Understand your data  No answers available, only data.

Over-fitting – too much training

Page 34: Tying up loose ends.  Understand your data  No answers available, only data.

Over-fittingMisrepresentative data