K NOWING THE U NKNOWABLE T O KNOW THAT WE KNOW WHAT WE KNOW , AND TO KNOW THAT WE DO NOT KNOW WHAT WE DO NOT KNOW , THAT IS TRUE KNOWLEDGE . ― N ICOLAUS C OPERNICUS C ERTAIN INVENTIONS DISCLOSED IN THIS PRESENTATION MAY BE CLAIMED WITHIN PATENTS OWNED OR PATENT APPLICATIONS FILED BY V IVID C ORTEX , I NC .
37
Embed
MySQL Performance Metrics You Can't Measure - Using Regression Instead
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
KNOWING THE UNKNOWABLE
TO KNOW THAT WE KNOW WHAT WE KNOW, AND TO KNOW THAT WE DO NOT KNOW WHAT WE DO NOT KNOW, THAT IS TRUE KNOWLEDGE.
― NICOLAUS COPERNICUS
CERTAIN INVENTIONS DISCLOSED IN THIS PRESENTATION MAY BE CLAIMED WITHIN PATENTS OWNED OR PATENT APPLICATIONS FILED BY VIVIDCORTEX, INC.
LOGISTICS
SLIDES AND RECORDING WILL BE EMAILED TO YOU AFTER THE WEBINAR
TWEET QUESTIONS TO @VIVIDCORTEX AT ANY TIME (AND FOLLOW US!)
PLEASE OBSERVE THE LIGHTED EXIT SIGNS AND TURN OFF ALL ELECTRONIC DEVICES ;-)
CONSTRAINED REGRESSION SMOOTHING AND SAMPLING STEP REGRESSION LOCAL REGRESSION LOGISTIC REGRESSION DECISION TREES AND RANDOM FORESTS BAYESIAN REGRESSION; MLE ENSEMBLES MACHINE LEARNING WAVELET DECOMPOSITION AND FFT COMMERCIAL SOFTWARE
PROBLEMS
TOO COMPLEX; TOO GENERAL
TOO SLOW AND COSTLY; O(N2) IN X-VARS
PARTIAL RESULTS
FOOLED BY CORRELATED X-VARS
0 100 200 300 400 500 600 700
2e+0
63e
+06
4e+0
65e
+06
6e+0
67e
+06
Index
q.63
68bf
5907
564a
9f ti
me
0 100 200 300 400 500 600 7000
5000
1000
015
000
2000
0Index
e.78
5f8a
c3c1
ea1c
93 ti
me
0 100 200 300 400 500 600 700
5.0e
+07
1.0e
+08
1.5e
+08
Index
Serv
er C
PU ti
me
QUERY TIMES / SERVER CPU TIME
2e+06 4e+06 6e+06
5.0e
+07
1.5e
+08
Query q.6368bf5907564a9f time
Serv
er C
PU ti
me
0 5000 10000 15000 20000
5.0e
+07
1.5e
+08
Query e.785f8ac3c1ea1c93 time
Serv
er C
PU ti
me
QUERIES VS CPU
ORDINARY MULTIPLE LEAST-SQUARES REGRESSION
REQUIREMENTS
MEMORY & CPU EFFICIENT FOR LARGE DATASETS
FULL RESULTS
NO PRECOMPUTATION
REASONABLE ACCURACY
SIMPLE & PHYSICALLY REALISTIC
INSIGHTS
NON-NEGATIVE SLOPES
IDENTICAL DIMENSIONS
INDEPENDENC OF X-VARS
ALL VARS SIGNIFICANT
VARS ROUGHLY SIMILAR
ION CANNONS NOT NEEDED
WEIGHTED LINEAR REGRESSION
WEIGHTED LINEAR REGRESSION
0
250
500
750
1000
X-V Y-V Z-V
WEIGHTED LINEAR REGRESSION
0
250
500
750
1000
X-V Y-V Z-V
WEIGHTED LINEAR REGRESSION
0
250
500
750
1000
X-V Y-V Z-V
WEIGHTED LINEAR REGRESSION
0
250
500
750
1000
X-V Y-V Z-V
2e+06 4e+06 6e+06
5.0e
+07
1.5e
+08
Query q.6368bf5907564a9f time
Serv
er C
PU ti
me
0 5000 10000 15000 20000
5.0e
+07
1.5e
+08
Query e.785f8ac3c1ea1c93 time
Serv
er C
PU ti
me
2e+06 4e+06 6e+06
5.0e
+06
1.5e
+07
2.5e
+07
Query q.6368bf5907564a9f time
Allo
cate
d Se
rver
CPU
tim
e
0 5000 10000 15000 20000
020
000
4000
060
000
Query e.785f8ac3c1ea1c93 timeAl
loca
ted
Serv
er C
PU ti
me
RESULTS
METRICS OF QUALITY
R2
STANDARD ERROR; T-STATISTIC
F-STATISTIC
MAPE
RESIDUAL PLOTS
OUR APPROACH
DESCRIPTIVE STATS
VISUALIZATION
PREDICTION AND SCORING
SUBSETTING
DESCRIPTIVE STATS
QUERY CLASS SAMPLES R SLOPE T-VALUE INTERCEPT T-VALUE