Top Banner
Copyright © 2010 SAS Institute Inc. All rights reserved. DASI: Analytics in Practice and Academic Analytics Preparation Mia Stephens – [email protected]
19

DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

Jun 30, 2018

Download

Documents

leque
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

Copyright  ©  2010  SAS  Institute  Inc.  All  rights  reserved.

DASI: Analytics in Practice and Academic Analytics PreparationMia Stephens – [email protected]

Page 2: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

2

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Background

§ TQM  Coordinator/Six  Sigma  MBB

§ Founding  Partner,  Statistical  Trainer  and  Consultant,  North  Haven  Group

§ Senior  Consultant  and  Trainer,  George  Group  (now  Accenture)

§ Adjunct  Professor  Statistics,  University  of  New  Hampshire

§ Academic  Ambassador,  JMP  Academic  Team

§ Author

Page 3: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

3

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Background

Page 4: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

4

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

What is Analytics?“…an  encompassing  and  multidimensional  field  that  uses  mathematics,  statistics,  predictive  modeling  and  machine-­learning  techniques  to  find  meaningful  patterns  and  knowledge in  recorded  data.”  

§ Descriptive/explanatory  statistics. Understand  what  happened  in  the  past  and  why.  

§ Predictive  analytics.  Using  past  data  and  predictive  algorithms  to  determine  what  will  happen  next.  

§ Prescriptive  analytics. Answering  the  question  of  what  to  do,  by  providing  information  on  optimal  decisions  based  on  the  predicted  future  scenarios.  

From  SAS  Analytics:    http://www.sas.com/en_us/insights/analytics/what-­is-­analytics.html

Page 5: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

5

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Analytics Frameworks§ SEMMA  (SAS)

§ Sample§ Explore§ Modify§ Model§ Assess

§ CRISP-­DM  (Cross  Industry  Standard  Process  for  Data  Mining)§ Business  Understanding§ Data  Understanding§ Data  Preparation§ Modeling§ Evaluation§ Deployment

Page 6: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

6

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

The Business Analytics Process

Define  the  Problem

Prepare  for  Modeling

Modeling

Deploy  Model

Monitor  Performance

Business  Problem

BusinessAnalyticsProcess

From  Building  Better  Models  with  JMP  Pro,  Grayson,  Gardner  and  Stephens,  2015.

May  loop  back  at  any  step

Page 7: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

7

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

The Business Analytics Process

Define  the  Problem

Prepare  for  Modeling

Modeling

Deploy  Model

Monitor  Performance

Business  Problem

BusinessAnalyticsProcess

From  Building  Better  Models  with  JMP  Pro,  Grayson,  Gardner  and  Stephens,  2015.

May  loop  back  at  any  stepThe  focus  

of  most  courses?    

What  do  we  like  to  teach?

Page 8: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

8

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Key  Tools:• Multiple  Regression• Logistic  Regression• Naïve  Bayes• kNN• Classification  and  Regression  Trees• Bootstrap  Forests  and  Boosted  Trees

• Neural  Networks• Generalized  Linear  Models• Survival  Models• Forecasting/Time  Series• Model  Comparison• Text  Mining

Modeling: Activities and ToolsKey  Activities:• Choose  the  appropriate  modeling  method  or  methods

• Fit  one  or  more  models• Evaluate  the  performance  of  each  model  using  validation  statistics  (misclassification,  RMSE,  Rsquare)

• Choose  the  best  model  or  set  of  models  to  address  the  analytics  problem  (and  ultimately  the  business  problem)

• **Create  ensemble  models

Page 9: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

9

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

The Business Analytics Process

Define  the  Problem

Prepare  for  Modeling

Modeling

Deploy  Model

Monitor  Performance

Business  Problem

BusinessAnalyticsProcess

From  Building  Better  Models  with  JMP  Pro,  Grayson,  Gardner  and  Stephens,  2015.

May  loop  back  at  any  stepWhat  is  the  

most  time-­consuming  step?

Page 10: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

10

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Key  Tools:• SQL,  data  import• Data  table  structuring  -­ join,  concatenate,  update,  stack,  summarize,…

• Summary  statistics  and  graphical  displays,  interactive  tools  and  filtering  

• Multivariate  procedures  (clustering,  PCA,…)

• Transformations,  creating  derived  variables,  recoding,  binning

• Addressing  missing  data  and  outliers

• Creating  holdout  set(s)

Data Preparation: Activities and ToolsKey  Activities:• Determine  which  data  are  needed

• Compile  (or  collect  new)  data• Explore,  examine  and  understand  data

• Assess  data  quality• Clean  and  transform  data• Define  features• Reduce  dimensionality• Create  training,  validation  and  test  sets

Page 11: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

11

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

The Business Analytics Process

Define  the  Problem

Prepare  for  Modeling

Modeling

Deploy  Model

Monitor  Performance

Business  Problem

BusinessAnalyticsProcess

From  Building  Better  Models  with  JMP  Pro,  Grayson,  Gardner  and  Stephens,  2015.

May  loop  back  at  any  stepMost  neglected  

topics  – both  academically  and  in  practice?

Page 12: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

12

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Define the Problem

§ Understand  the  business  problem(s)  and  objectives

§ ROI

§ Frame  the  analytics  problem  and  objectives

§ Define  project  goals

§ Develop  a  project  plan

§ Obtain  resources,  buy-­in,  approvals

Page 13: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

13

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

• Deliver  the  model  and  model  results  to  the  business  or  internal  customers

• Communicate  results  (graphs  and  profilers,  summaries,  explore  “what  if”  scenarios)

• Assist  in  applying  model  insights  and  implementing  ongoing  use  of  the  model  

• Document  the  project• Close  out  the  project• Evaluate  and  quantify  improvement• Revise  model• Identify  additional  opportunities/problems

Deploy Models and Monitor Performance

Page 14: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

14

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Best Practices from Top Programs

How  to  provide  context  and  real-­world  experience?

§ Focus  on  application  to  real  business  problems§ Method-­specific  case  studies

§ Capstone  projects

§ Work  in  cross-­functional  teams

§ Industry  partnerships  and  projects

§ Internships

Page 15: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

15

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Best Practices from Top Programs

Analytics  is  different  from  statistics§ Not  statistics  programs/courses  simply  rebranded  or  

renamed

§ Heavy  in  application  versus  theory,  equations  and  computation

§ Start  from  the  ground  up  in  designing  programs/courses

Page 16: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

16

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Best Practices from Top Programs

Predictive  analytics  is  different  from  descriptive  or  explanatory  modeling

§ De-­emphasize:    » hand-­computation

» p-­values

» rigorous  adherence  to  meeting  assumptions

» measures  of  goodness-­of-­fit

§ Emphasize:    model  accuracy  and  predictive  ability  on  holdout  sample

Page 17: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

17

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Feedback from Industry

Most  important  skill  -­ data  understanding  and  intuition  § More  important  than  understanding  of  theories  and  equations

Students  need  to  be  able  to:§ think  with  data§ know  which  methods  to  apply  and  when§ to  understand  and  communicate  the  story  in  the  data

Page 18: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

18

Copyright  ©  2010,  SAS  Institute  Inc.  All  rights  reserved.

Feedback from Industry

Learning  software/programming  language  is  secondary  to  learning  the  concepts  and  methods

§ The  end  goal  is  not  to  teaching  statistical  software  or  programming

§ “We  can  teach  the  software  we  use,  we  can’t  teach  data  intuition”  

§ Software  should  facilitate  learning

§ Toolkit  – use  the  tools  that  best  support  the  learning  objectives  (exploration  of  concepts,  development  of  models,  deployment,  communication,…)

Page 19: DASI: Analytics in Practice and Academic Analytics …randrews/DASI/DSI2016/Mia_JMP_2016...Title _2016 DSI DASI Created Date 20161122220559Z

Copyright  ©  2010  SAS  Institute  Inc.  All  rights  reserved.

[email protected]/academic

Thank you