Top Banner
Python and Data Analysis 2017 Copyright @ Data Application Lab 2017-04-15 Course info session 程介公开
36

Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Jun 17, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python and Data Analysis

2017 Copyright @ Data Application Lab 2017-04-15

Course info session

课程介绍公开课

Page 2: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Agenda

• Data Application Lab 简介• Python & Data Analysis 课程纲要• Josh老师: Python以及课程Capstone Project• Jason老师: Python在数据科学中的作用• Frank老师: Python在金融数据分析中的作用• Kai同学: 前学员如何实际利用Python解决问题• Q&A

Page 3: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Data Application Lab

Swift

Python

C++

Java C

JavaScript

C

SQL

PHP

Page 4: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

FYI:• https://www.dataapplab.com/about-us/

• Partner: Data Science Association• http://socaldatascience.org/ 2016.9 @ Southern California• http://www.datascienceassociations.org/• 2017.2 @ Dallas 2017.5 @Chicago 2017.10 @ Pasadena

• Facebook: https://www.facebook.com/dataapplab/

• Linkedin: https://www.linkedin.com/company/7956968

• 微信: 大数据应用

Page 5: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Syllabus

http://dataapplab.com/python

5周课程, 3周从基础起学习Python, 后2周实战数据分析 (金融Quant分析+ Web Project)

Page 6: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Introducing Python

Josh

Your First Programming Language

Page 7: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python PopularityYour first programming language

ü Python is NOW the most popular introductory language at TOP U.S University

ü Eight of the top 10 CS departments (80%), and 27 of the top 39 (69%), teach Python in introductory CS0 or CS1 courses.

ü The largest three MOOC providers (edX, Coursera, and Udacity) all offer introductory programming courses in Python.

COMMUNICATIONS OF THE ACMhttp://cacm.acm.org/blogs/blog-cacm/176450-python-is-now-the-most-popular-introductory-teaching-language-at-top-u-s-universities/fulltext

Page 8: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python PopularityJob opportunities

codementorhttps://www.codementor.io/learn-programming/beginner-programming-language-job-salary-community

89,000

92,000

94,000

99,000

102,000

104,000

107,000

115,000

0 20,000 40,000 60,000 80,000 100,000 120,000 140,000

2016 Average Developer Salary in the U.S. indeed.comestimation($)

Swift

Python

C++

Java C

JavaScript

C

SQL

PHP

Page 9: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python PopularityThe developer supply market

codementorhttps://www.codementor.io/learn-programming/beginner-programming-language-job-salary-community

6% 11% 12% 21% 22% 25% 25% 30% 40% 48%

0 0.125 0.25 0.375 0.5

Codementors Expertise Distribution

Ruby

Python

Java

JavaScript

C#

SQL

PHP

C++

C

Objective-C

Page 10: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python PopularitySome famous platform / software that relies on Python

Page 11: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python PopularityPython is good for data analysis

ü TensorFlow:• Python• C++• Go (Experiment)

ü Other Platform:• Caffe: Python• Paddle Paddle: Python C++

• Java: Almost the only choice is dl4j

Page 12: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python Web

Josh

Project description

Page 13: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Why Web Development(offline vs online)

Build Better Products Make Better Decisions

Page 14: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python Ecosystem

Data Analysis & Visualization Web Development

Page 15: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Library, Framework

Page 16: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Mysql, MongoDB

Python, NodeJs, Ruby on Rail

HTML5, Javascript, CSS

Web Development Full Stack

Front End

Back End

Data Sets

Infrastructure

Servers, Cloud (AWS, Google, Azure), Container

Page 17: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

We will learn MVC Architecture

Page 18: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Course Purpose

• Understand Web Basic Knowledge• Learn Flask Framework• Understand How Data Analysis Model being applied on Web

Application

Page 19: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Project Demo Example:

Page 20: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python in Data Science

Jason

Page 21: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Why Data Science?Business

Requirement

Data Acquisition

Data Preparation

Hypothesis & Modeling

Evaluation & Interpretation

Deployment

Operations

Optimization

Python

Page 22: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Why use python?

Everywhere in industry for all kinds of jobs:• National Labs• Finance• Oil & Gas• Consumer Products• Aerospace/Defense• Data Analytics• School teaching• Mini Projects

Page 23: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python common toolkits in Data Science

• numpy: basic array manipulation• scipy: scientific computing in python, including signal processing

and optimization• matplotlib: visualization and plotting• IPython: write and run python code interactively in a shell or a

notebook• pandas: data manipulation• scikit-learn: machine learning

Page 24: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Machine Learning

Data Collection

Communication & Storytelling

Data Wrangling

Product Development & Feedback Analysis

Data Visualization

Statistics

Domain Knowledge &

Business Mindset

Data Science Skill Tree

Need Python

Page 25: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Data (Quant) Analysis

Frank

Brief introduction

Page 26: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Python in Analysis?

• HFT: AQR, KCG, Citadel• Asset Management: BlackRock, Vanguard• Banking: Goldman Sachs, JP Morgan• Rating Agency: S&P, Moody• Bloomberg

Page 27: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

C++ is used everywhere, Python is widely used in financial industry.

• Goldman Sachs: Slang• Morgan Stanley: Q• JP Morgan: Python• Bank of America Merrill Lynch: Python• Jane Street: Ocaml• Two Sigma: Java• AQR: Python• KCG: C++/Python

Page 28: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

For OOP fans

• Interested in coding.• Know how to prepare programming interview.

Page 29: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

What else?

• Stochastic Calculus• Machine learning• Time series, prob, stats• Derivatives• Data Structures and Algo, Linux

Page 30: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal
Page 31: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Course

• Monte Carlo with variance reduction• European and American Option• VaR and CVA• PCA in Finance• Black-Litterman model

Page 32: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

After course

• You can answer interview questions like:1. What's the difference between PCA and Linear Regression?2. When do we need PCA?3. What kind of data do we need before PCA?4. How to explain the result in PCA? What

does COMPONENT mean? What does eigenvector mean?

Page 33: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

After course

• Project:• PCA in DAX30 replication for ETF product• European and American option

Page 34: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Our Student Story

Kai

Python project to crawl data

Page 35: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

Q&A?

Page 36: Python and Data Analysis - Data Application Lab...Python common toolkits in Data Science •numpy: basic array manipulation •scipy: scientific computing in python, including signal

课程报名方法:

• http://dataapplab.com/python• 网页直接报名,我们会联络您• 网页在线Chat• 邮件报名: 发送个人信息到 [email protected]• 电话咨询 +1 (800) 485 – 7918• 微信公众号 “大数据应用”• 加微信号私聊(微信号: data-science)