Top Banner
Why Data Science is Something You Should Care About Presented @ South Dakota Code Camp 2012 Ryan Swanstrom @swgoof
22

Learn Data Science

Jan 15, 2015

Download

Documents

swgoof

Big Data and Data Science are hot buzzwords right now. The buzzwords might go away but the ideas will not. This talk will explain the buzzwords, and it will cover some of the best resources for attaining data science skills.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Learn Data Science

Why Data Science is Something You Should Care About

Presented @ South Dakota Code Camp 2012

Ryan Swanstrom @swgoof

Page 2: Learn Data Science

About Ryan Swanstrom

Find me on the web

http://twitter.com/swgoof

http://linkedin.com/in/ryanswanstrom

http://datascience101.wordpress.com/

Page 3: Learn Data Science

Data Science

"[ability to] obtain, scrub, explore, model and interpret data, blending hacking, statistics, and machine learning."

definition by Hilary Mason, Chief Scientist @ Bit.ly

Page 5: Learn Data Science

Who is a data scientist?

http://onforb.es/WNLnRu

Page 6: Learn Data Science

Big Data

Any dataset where the size or speed of incoming data causes difficulties in processing

● Volume● Velocity● Variety

Page 7: Learn Data Science

Hadoop

"[...] a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models."

Apache Hadoop Website

● HDFS - Hadoop Distributed File System● MapReduce

Page 8: Learn Data Science

Lots of Data

18 Monthsthe amount of time for digital data to double

Page 9: Learn Data Science

Data Products

Page 10: Learn Data Science

Why Do You Care?

McKinsey Global Big Data Report

● 140k - 190k Unfilled Jobs by 2018

● 1.5M Managers & Analysts

Page 12: Learn Data Science

Now That You Care, What Skills?

1. Machine Learning2. Statistics3. Story Telling (Communication)4. Big Data5. Algorithms6. Curiosity

Page 14: Learn Data Science

College and University

Pros

● Credentials● Experts● Familiar● Widely Accepted● Structured

Cons

● Expensive● Not Individualized● School● Lengthy● Inflexible● Not Real World

Page 16: Learn Data Science

Corporate Training

Pros

● Short Timeframe● Experts● Certificates● Business-Savy● Real World● Structured

Cons

● Expensive● Not Individualized● Product Focused● Sales Pitch

Page 17: Learn Data Science

MOOCs (Massive Open Online Courses)

Page 18: Learn Data Science

MOOCs (Massive Open Online Courses)

Pros

● Free● Experts● Flexible

Cons

● No Credentials● Single Course● No Programs (Yet)

Page 20: Learn Data Science

Blogs/Wikis/Other

Pros

● Free● Very Specific● Short● Lots of them

Cons

● Quality?● No Credentials● No Structure● Too many!

Page 21: Learn Data Science

Blogs/Wikis/Other

The Problem

● What content is good?

● What order should I cover the content?

● Where do I find new content?

● Who can help me understand?

Page 22: Learn Data Science

Data Science 201 - coming soon

http://www.datascience201.comHelping you find the best

data science learning content!

Thank You