Top Banner
Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart 1 Princeton September 12, 2018 1 These slides are heavily influenced by Matt Blackwell and Adam Glynn with contributions from Justin Grimmer and Matt Salganik. Illustrations by Shay O’Brien. Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 1 / 60
273

Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mar 24, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Soc500: Applied Social Statistics

Week 1: Introduction and Probability

Brandon Stewart1

Princeton

September 12, 2018

1These slides are heavily influenced by Matt Blackwell and Adam Glynn with contributionsfrom Justin Grimmer and Matt Salganik. Illustrations by Shay O’Brien.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 1 / 60

Page 2: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Where We’ve Been and Where We’re Going...

Last WeekI methods campI pre-grad school life

This WeekI Wednesday

F welcomeF basics of probability

Next WeekI random variablesI joint distributions

Long RunI probability → inference → regression → causal inference

Questions?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 2 / 60

Page 3: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 4: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 5: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

I

I . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 6: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.

I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 7: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statistics

I . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 8: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysis

I . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 9: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative research

I . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 10: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 11: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your Preceptors

I sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 12: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all things

I Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 13: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Welcome and Introductions

The tale of two classes: Soc400/Soc500 Applied Social Statistics

II . . . am an Assistant Professor in Sociology.I . . . am trained in political science and statisticsI . . . do research in methods and statistical text analysisI . . . love doing collaborative researchI . . . talk very quickly

Your PreceptorsI sage guides of all thingsI Shay O’Brien (Soc500)I Alex Kindel (Soc400)I Ziyao Tian (Soc400)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 3 / 60

Page 14: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 4 / 60

Page 15: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 4 / 60

Page 16: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Overview

Goal: train you in statistical thinking

First in a two course sequence replication project and longerarc

Difficult course but with many resources to support you.

When we are done you will be able to teach yourself many things

Syllabus is a useful resource including philosophy of the class.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 5 / 60

Page 17: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Overview

Goal: train you in statistical thinking

First in a two course sequence replication project and longerarc

Difficult course but with many resources to support you.

When we are done you will be able to teach yourself many things

Syllabus is a useful resource including philosophy of the class.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 5 / 60

Page 18: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Overview

Goal: train you in statistical thinking

First in a two course sequence replication project and longerarc

Difficult course but with many resources to support you.

When we are done you will be able to teach yourself many things

Syllabus is a useful resource including philosophy of the class.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 5 / 60

Page 19: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Overview

Goal: train you in statistical thinking

First in a two course sequence replication project and longerarc

Difficult course but with many resources to support you.

When we are done you will be able to teach yourself many things

Syllabus is a useful resource including philosophy of the class.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 5 / 60

Page 20: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Overview

Goal: train you in statistical thinking

First in a two course sequence replication project and longerarc

Difficult course but with many resources to support you.

When we are done you will be able to teach yourself many things

Syllabus is a useful resource including philosophy of the class.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 5 / 60

Page 21: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Overview

Goal: train you in statistical thinking

First in a two course sequence replication project and longerarc

Difficult course but with many resources to support you.

When we are done you will be able to teach yourself many things

Syllabus is a useful resource including philosophy of the class.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 5 / 60

Page 22: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Specific Goals

For the semesterI critically read, interpret and replicate the quantitative content

of many articles in the quantitative social sciencesI conduct, interpret, and communicate results from analysis using

multiple regressionI explain the limitations of observational data for making causal

claimsI write clean, reusable, and reliable R code.I feel empowered working with data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 6 / 60

Page 23: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Specific Goals

For the semester

I critically read, interpret and replicate the quantitative contentof many articles in the quantitative social sciences

I conduct, interpret, and communicate results from analysis usingmultiple regression

I explain the limitations of observational data for making causalclaims

I write clean, reusable, and reliable R code.I feel empowered working with data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 6 / 60

Page 24: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Specific Goals

For the semesterI critically read, interpret and replicate the quantitative content

of many articles in the quantitative social sciences

I conduct, interpret, and communicate results from analysis usingmultiple regression

I explain the limitations of observational data for making causalclaims

I write clean, reusable, and reliable R code.I feel empowered working with data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 6 / 60

Page 25: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Specific Goals

For the semesterI critically read, interpret and replicate the quantitative content

of many articles in the quantitative social sciencesI conduct, interpret, and communicate results from analysis using

multiple regression

I explain the limitations of observational data for making causalclaims

I write clean, reusable, and reliable R code.I feel empowered working with data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 6 / 60

Page 26: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Specific Goals

For the semesterI critically read, interpret and replicate the quantitative content

of many articles in the quantitative social sciencesI conduct, interpret, and communicate results from analysis using

multiple regressionI explain the limitations of observational data for making causal

claims

I write clean, reusable, and reliable R code.I feel empowered working with data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 6 / 60

Page 27: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Specific Goals

For the semesterI critically read, interpret and replicate the quantitative content

of many articles in the quantitative social sciencesI conduct, interpret, and communicate results from analysis using

multiple regressionI explain the limitations of observational data for making causal

claimsI write clean, reusable, and reliable R code.

I feel empowered working with data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 6 / 60

Page 28: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Specific Goals

For the semesterI critically read, interpret and replicate the quantitative content

of many articles in the quantitative social sciencesI conduct, interpret, and communicate results from analysis using

multiple regressionI explain the limitations of observational data for making causal

claimsI write clean, reusable, and reliable R code.I feel empowered working with data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 6 / 60

Page 29: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why R?

It will give you super powers (but not at first)

It is free and open source

It is the de facto standard in many applied statistical fields

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 7 / 60

Page 30: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why R?

It will give you super powers (but not at first)

It is free and open source

It is the de facto standard in many applied statistical fields

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 7 / 60

Page 31: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why R?

It will give you super powers (but not at first)

It is free and open source

It is the de facto standard in many applied statistical fields

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 7 / 60

Page 32: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why RMarkdown?What you’ve done before

Image Credit: Baumer et al (2014)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 8 / 60

Page 33: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why RMarkdown?

RMarkdown

Image Credit: Baumer et al (2014)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 9 / 60

Page 34: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 10 / 60

Page 35: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 10 / 60

Page 36: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 37: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 38: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuition

I no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 39: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sake

I we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 40: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to ask

I course focus on how to reason about statistics, not justmemorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 41: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 42: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 43: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 44: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Mathematical Prerequisites

No formal pre-requisites

Balancing rigor and intuitionI no rigor for rigor’s sakeI we will tell you why you need the math, but also feel free to askI course focus on how to reason about statistics, not just

memorize guidelines

We will teach you any math you need as we go along

Crucially though- this class is not about innate statisticalaptitude, it is about effort

We all come from very different backgrounds. Please havepatience with yourself and with others.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 11 / 60

Page 45: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 12 / 60

Page 46: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 12 / 60

Page 47: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 12 / 60

Page 48: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 12 / 60

Page 49: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Reading

Think of the lecture slides as primary reading.

How to think about the reading?

Key Books:I Angrist and Pischke (2008) Mostly Harmless EconometricsI Aronow and Miller. Forthcoming. Foundations of Agnostic

Statistics (not yet available)I Blitzstein and Hwang. 2014. Introduction to Probability

(available online through the library)

Optional Books:I Fox (2016) Applied Regression Analysis and Generalized Linear

ModelsI Imai (2017) A First Course in Quantitative Social Science

Why so many books?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 13 / 60

Page 50: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Reading

Think of the lecture slides as primary reading.

How to think about the reading?

Key Books:I Angrist and Pischke (2008) Mostly Harmless EconometricsI Aronow and Miller. Forthcoming. Foundations of Agnostic

Statistics (not yet available)I Blitzstein and Hwang. 2014. Introduction to Probability

(available online through the library)

Optional Books:I Fox (2016) Applied Regression Analysis and Generalized Linear

ModelsI Imai (2017) A First Course in Quantitative Social Science

Why so many books?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 13 / 60

Page 51: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Reading

Think of the lecture slides as primary reading.

How to think about the reading?

Key Books:I Angrist and Pischke (2008) Mostly Harmless EconometricsI Aronow and Miller. Forthcoming. Foundations of Agnostic

Statistics (not yet available)I Blitzstein and Hwang. 2014. Introduction to Probability

(available online through the library)

Optional Books:I Fox (2016) Applied Regression Analysis and Generalized Linear

ModelsI Imai (2017) A First Course in Quantitative Social Science

Why so many books?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 13 / 60

Page 52: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Reading

Think of the lecture slides as primary reading.

How to think about the reading?

Key Books:I Angrist and Pischke (2008) Mostly Harmless EconometricsI Aronow and Miller. Forthcoming. Foundations of Agnostic

Statistics (not yet available)I Blitzstein and Hwang. 2014. Introduction to Probability

(available online through the library)

Optional Books:I Fox (2016) Applied Regression Analysis and Generalized Linear

ModelsI Imai (2017) A First Course in Quantitative Social Science

Why so many books?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 13 / 60

Page 53: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Reading

Think of the lecture slides as primary reading.

How to think about the reading?

Key Books:I Angrist and Pischke (2008) Mostly Harmless EconometricsI Aronow and Miller. Forthcoming. Foundations of Agnostic

Statistics (not yet available)I Blitzstein and Hwang. 2014. Introduction to Probability

(available online through the library)

Optional Books:I Fox (2016) Applied Regression Analysis and Generalized Linear

ModelsI Imai (2017) A First Course in Quantitative Social Science

Why so many books?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 13 / 60

Page 54: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Reading

Think of the lecture slides as primary reading.

How to think about the reading?

Key Books:I Angrist and Pischke (2008) Mostly Harmless EconometricsI Aronow and Miller. Forthcoming. Foundations of Agnostic

Statistics (not yet available)I Blitzstein and Hwang. 2014. Introduction to Probability

(available online through the library)

Optional Books:I Fox (2016) Applied Regression Analysis and Generalized Linear

ModelsI Imai (2017) A First Course in Quantitative Social Science

Why so many books?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 13 / 60

Page 55: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 14 / 60

Page 56: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 14 / 60

Page 57: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Problem Sets

Schedule (available Wednesday, due 8 days later at precept)

Grading and solutions

Collaboration policy

You may find these difficult. Start early and seek help!

Most important part of the class

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 15 / 60

Page 58: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Problem Sets

Schedule (available Wednesday, due 8 days later at precept)

Grading and solutions

Collaboration policy

You may find these difficult. Start early and seek help!

Most important part of the class

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 15 / 60

Page 59: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Problem Sets

Schedule (available Wednesday, due 8 days later at precept)

Grading and solutions

Collaboration policy

You may find these difficult. Start early and seek help!

Most important part of the class

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 15 / 60

Page 60: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Problem Sets

Schedule (available Wednesday, due 8 days later at precept)

Grading and solutions

Collaboration policy

You may find these difficult. Start early and seek help!

Most important part of the class

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 15 / 60

Page 61: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Problem Sets

Schedule (available Wednesday, due 8 days later at precept)

Grading and solutions

Collaboration policy

You may find these difficult. Start early and seek help!

Most important part of the class

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 15 / 60

Page 62: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Your Job: work hard and get help when you need it!

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 16 / 60

Page 63: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Your Job: work hard and get help when you need it!

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 16 / 60

Page 64: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Your Job: work hard and get help when you need it!

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 16 / 60

Page 65: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Ways to Learn

Lecturelearn broad topics

Preceptlearn data analysis skills, get targeted help on assignments

Readingssupport materials for lecture and precept

Problem Setsreinforce understanding of material, practice

Piazzaask questions of us and your classmates

Office Hoursask even more questions.

Your Job: work hard and get help when you need it!

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 16 / 60

Page 66: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Daily Feedback

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 17 / 60

Page 67: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Daily Feedback

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 17 / 60

Page 68: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Daily Feedback

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 17 / 60

Page 69: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

How to Get Help

1 Class and Precept

2 Daily Feedback

3 Readings and Slides

4 Piazza

5 Preceptor Office Hours

6 Instructor Office Hours

7 Final Exam Prep

8 External Consulting

9 Individual and Group Tutoring

Read the syllabus for more details.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 18 / 60

Page 70: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

How to Get Help

1 Class and Precept

2 Daily Feedback

3 Readings and Slides

4 Piazza

5 Preceptor Office Hours

6 Instructor Office Hours

7 Final Exam Prep

8 External Consulting

9 Individual and Group Tutoring

Read the syllabus for more details.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 18 / 60

Page 71: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

How to Get Help

1 Class and Precept

2 Daily Feedback

3 Readings and Slides

4 Piazza

5 Preceptor Office Hours

6 Instructor Office Hours

7 Final Exam Prep

8 External Consulting

9 Individual and Group Tutoring

Read the syllabus for more details.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 18 / 60

Page 72: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

How to Get Help

1 Class and Precept

2 Daily Feedback

3 Readings and Slides

4 Piazza

5 Preceptor Office Hours

6 Instructor Office Hours

7 Final Exam Prep

8 External Consulting

9 Individual and Group Tutoring

Read the syllabus for more details.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 18 / 60

Page 73: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

How to Get Help

1 Class and Precept

2 Daily Feedback

3 Readings and Slides

4 Piazza

5 Preceptor Office Hours

6 Instructor Office Hours

7 Final Exam Prep

8 External Consulting

9 Individual and Group Tutoring

Read the syllabus for more details.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 18 / 60

Page 74: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

How to Get Help

1 Class and Precept

2 Daily Feedback

3 Readings and Slides

4 Piazza

5 Preceptor Office Hours

6 Instructor Office Hours

7 Final Exam Prep

8 External Consulting

9 Individual and Group Tutoring

Read the syllabus for more details.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 18 / 60

Page 75: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 76: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 77: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 78: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 79: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 80: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 81: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 82: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Advice from Prior Generations

Definitely take it! And be prepared to set aside a lot of time.

Ask questions if you don’t know what’s going on!

Study hard, work hard, review the slides.

Investing a considerable amount of time in getting familiar withR and its various tools will pay off in the long run!

Go over the lecture slides each week. This can be hard when youfeel like you’re treading water and just staying afloat, but I wishI had done this regularly.

It’s challenging but very doable and rewarding if you put the timein. There are plenty of resources to take advantage of for help.

This course is very challenging but greatly contributed to myunderstanding of social statistics. If you’re truly invested in thesubject and willing to put in the work (more than you expectpossibly), it will be one of the best courses you’ve taken.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 19 / 60

Page 83: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Outline of Topics

Outline in reverse order:

Causal Inference:assess the effect of a counterfactual intervention using observedassociations.

Regression:measure the association (expectation of a variable given anumber of others).

Inference:learn about things we don’t know from the things we do know

Probability:learn what data we would expect if we did know the truth.

Probability → Inference → Regression → Causal Inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 20 / 60

Page 84: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Outline of Topics

Outline in reverse order:

Causal Inference:assess the effect of a counterfactual intervention using observedassociations.

Regression:measure the association (expectation of a variable given anumber of others).

Inference:learn about things we don’t know from the things we do know

Probability:learn what data we would expect if we did know the truth.

Probability → Inference → Regression → Causal Inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 20 / 60

Page 85: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Outline of Topics

Outline in reverse order:

Causal Inference:assess the effect of a counterfactual intervention using observedassociations.

Regression:measure the association (expectation of a variable given anumber of others).

Inference:learn about things we don’t know from the things we do know

Probability:learn what data we would expect if we did know the truth.

Probability → Inference → Regression → Causal Inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 20 / 60

Page 86: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Outline of Topics

Outline in reverse order:

Causal Inference:assess the effect of a counterfactual intervention using observedassociations.

Regression:measure the association (expectation of a variable given anumber of others).

Inference:learn about things we don’t know from the things we do know

Probability:learn what data we would expect if we did know the truth.

Probability → Inference → Regression → Causal Inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 20 / 60

Page 87: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Outline of Topics

Outline in reverse order:

Causal Inference:assess the effect of a counterfactual intervention using observedassociations.

Regression:measure the association (expectation of a variable given anumber of others).

Inference:learn about things we don’t know from the things we do know

Probability:learn what data we would expect if we did know the truth.

Probability → Inference → Regression → Causal Inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 20 / 60

Page 88: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Outline of Topics

Outline in reverse order:

Causal Inference:assess the effect of a counterfactual intervention using observedassociations.

Regression:measure the association (expectation of a variable given anumber of others).

Inference:learn about things we don’t know from the things we do know

Probability:learn what data we would expect if we did know the truth.

Probability → Inference → Regression → Causal Inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 20 / 60

Page 89: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Attribution and Thanks

My philosophy on teaching: don’t reinvent the wheel- customize,refine, improve.

Huge thanks to those who have provided slides particularly:Matt Blackwell, Adam Glynn, Justin Grimmer, Jens Hainmueller,Erin Hartman, Kevin Quinn

Also thanks to those who have discussed with me at lengthincluding Dalton Conley, Chad Hazlett, Gary King, Kosuke Imai,Matt Salganik and Teppei Yamamoto.

Previous generations of preceptors have also been incredibleimportant: Clark Bernier, Elisha Cohen, Ian Lundberg andSimone Zhang.

Shay O’Brien produced the hand-drawn illustrations usedthroughout.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 21 / 60

Page 90: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Attribution and Thanks

My philosophy on teaching: don’t reinvent the wheel- customize,refine, improve.

Huge thanks to those who have provided slides particularly:Matt Blackwell, Adam Glynn, Justin Grimmer, Jens Hainmueller,Erin Hartman, Kevin Quinn

Also thanks to those who have discussed with me at lengthincluding Dalton Conley, Chad Hazlett, Gary King, Kosuke Imai,Matt Salganik and Teppei Yamamoto.

Previous generations of preceptors have also been incredibleimportant: Clark Bernier, Elisha Cohen, Ian Lundberg andSimone Zhang.

Shay O’Brien produced the hand-drawn illustrations usedthroughout.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 21 / 60

Page 91: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Attribution and Thanks

My philosophy on teaching: don’t reinvent the wheel- customize,refine, improve.

Huge thanks to those who have provided slides particularly:Matt Blackwell, Adam Glynn, Justin Grimmer, Jens Hainmueller,Erin Hartman, Kevin Quinn

Also thanks to those who have discussed with me at lengthincluding Dalton Conley, Chad Hazlett, Gary King, Kosuke Imai,Matt Salganik and Teppei Yamamoto.

Previous generations of preceptors have also been incredibleimportant: Clark Bernier, Elisha Cohen, Ian Lundberg andSimone Zhang.

Shay O’Brien produced the hand-drawn illustrations usedthroughout.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 21 / 60

Page 92: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Attribution and Thanks

My philosophy on teaching: don’t reinvent the wheel- customize,refine, improve.

Huge thanks to those who have provided slides particularly:Matt Blackwell, Adam Glynn, Justin Grimmer, Jens Hainmueller,Erin Hartman, Kevin Quinn

Also thanks to those who have discussed with me at lengthincluding Dalton Conley, Chad Hazlett, Gary King, Kosuke Imai,Matt Salganik and Teppei Yamamoto.

Previous generations of preceptors have also been incredibleimportant: Clark Bernier, Elisha Cohen, Ian Lundberg andSimone Zhang.

Shay O’Brien produced the hand-drawn illustrations usedthroughout.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 21 / 60

Page 93: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Attribution and Thanks

My philosophy on teaching: don’t reinvent the wheel- customize,refine, improve.

Huge thanks to those who have provided slides particularly:Matt Blackwell, Adam Glynn, Justin Grimmer, Jens Hainmueller,Erin Hartman, Kevin Quinn

Also thanks to those who have discussed with me at lengthincluding Dalton Conley, Chad Hazlett, Gary King, Kosuke Imai,Matt Salganik and Teppei Yamamoto.

Previous generations of preceptors have also been incredibleimportant: Clark Bernier, Elisha Cohen, Ian Lundberg andSimone Zhang.

Shay O’Brien produced the hand-drawn illustrations usedthroughout.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 21 / 60

Page 94: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Attribution and Thanks

My philosophy on teaching: don’t reinvent the wheel- customize,refine, improve.

Huge thanks to those who have provided slides particularly:Matt Blackwell, Adam Glynn, Justin Grimmer, Jens Hainmueller,Erin Hartman, Kevin Quinn

Also thanks to those who have discussed with me at lengthincluding Dalton Conley, Chad Hazlett, Gary King, Kosuke Imai,Matt Salganik and Teppei Yamamoto.

Previous generations of preceptors have also been incredibleimportant: Clark Bernier, Elisha Cohen, Ian Lundberg andSimone Zhang.

Shay O’Brien produced the hand-drawn illustrations usedthroughout.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 21 / 60

Page 95: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

This Class

Any questions about this class?

Let’s get started

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 22 / 60

Page 96: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

This Class

Any questions about this class?

Let’s get started

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 22 / 60

Page 97: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 23 / 60

Page 98: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 23 / 60

Page 99: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 100: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 101: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 102: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem

2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 103: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method

3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 104: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 105: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 106: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 107: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

What is Statistics?

Branch of mathematics studying collection and analysis of data

The name statistic comes from the word state

The arc of developments in statistics

1) an applied scholar has a problem2) they solve the problem by inventing a specific method3) statisticians generalize and export the best of these methods

Relatively recent field (started at the very end of the 19thcentury)

Provides a way of making principled guesses based on statedassumptions.

In practice, an essential part of research, policy making, politicalcampaigns, selling people things. . .

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 24 / 60

Page 108: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why study probability?

It enables inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 25 / 60

Page 109: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why study probability?

It enables inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 25 / 60

Page 110: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

In Picture Form

Data generatingprocess

Probability

Inference

Observeddata

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 26 / 60

Page 111: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

In Picture Form

Data generatingprocess

Probability

Inference

Observeddata

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 26 / 60

Page 112: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

In Picture Form

Data generatingprocess

Probability

Inference

Observeddata

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 26 / 60

Page 113: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

In Picture Form

Datagenerating

processObserved data

probability

inference

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 27 / 60

Page 114: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Statistical Thought Experiments

Start with probability

Allows us to contemplate world under hypothetical scenariosI hypotheticals let us ask- is the observed relationship happening

by chance or is it systematic?I it tells us what the world would look like under a certain

assumption

We will review probability today, but feel free to ask questions asneeded throughout the semester.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 28 / 60

Page 115: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Statistical Thought Experiments

Start with probability

Allows us to contemplate world under hypothetical scenariosI hypotheticals let us ask- is the observed relationship happening

by chance or is it systematic?I it tells us what the world would look like under a certain

assumption

We will review probability today, but feel free to ask questions asneeded throughout the semester.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 28 / 60

Page 116: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Statistical Thought Experiments

Start with probability

Allows us to contemplate world under hypothetical scenarios

I hypotheticals let us ask- is the observed relationship happeningby chance or is it systematic?

I it tells us what the world would look like under a certainassumption

We will review probability today, but feel free to ask questions asneeded throughout the semester.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 28 / 60

Page 117: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Statistical Thought Experiments

Start with probability

Allows us to contemplate world under hypothetical scenariosI hypotheticals let us ask- is the observed relationship happening

by chance or is it systematic?

I it tells us what the world would look like under a certainassumption

We will review probability today, but feel free to ask questions asneeded throughout the semester.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 28 / 60

Page 118: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Statistical Thought Experiments

Start with probability

Allows us to contemplate world under hypothetical scenariosI hypotheticals let us ask- is the observed relationship happening

by chance or is it systematic?I it tells us what the world would look like under a certain

assumption

We will review probability today, but feel free to ask questions asneeded throughout the semester.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 28 / 60

Page 119: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Statistical Thought Experiments

Start with probability

Allows us to contemplate world under hypothetical scenariosI hypotheticals let us ask- is the observed relationship happening

by chance or is it systematic?I it tells us what the world would look like under a certain

assumption

We will review probability today, but feel free to ask questions asneeded throughout the semester.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 28 / 60

Page 120: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 121: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup

(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 122: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 123: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment

(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 124: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 125: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical

(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 126: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 127: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result

(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 128: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 129: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Fisher’s Lady Tasting Tea

The Story Setup(lady discerning about tea)

The Experiment(perform a taste test)

The Hypothetical(count possibilities)

The Result(boom she was right)

This became the Fisher Exact Test.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 29 / 60

Page 130: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 30 / 60

Page 131: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

1 Welcome

2 Goals

3 Ways to Learn

4 Core Ideas

5 Introduction to ProbabilityWhat is Probability?Sample Spaces and EventsProbability FunctionsMarginal, Joint and Conditional ProbabilityBayes’ RuleIndependence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 30 / 60

Page 132: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

From ‘Probably’ to Probability

Can we make this more precise?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 31 / 60

Page 133: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why Probability?

Helps us envision hypotheticals

Describes uncertainty in how the data is generated

Data Analysis: estimate probability that something will happen

Thus: we need to know how probability gives rise to data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 32 / 60

Page 134: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why Probability?

Helps us envision hypotheticals

Describes uncertainty in how the data is generated

Data Analysis: estimate probability that something will happen

Thus: we need to know how probability gives rise to data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 32 / 60

Page 135: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why Probability?

Helps us envision hypotheticals

Describes uncertainty in how the data is generated

Data Analysis: estimate probability that something will happen

Thus: we need to know how probability gives rise to data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 32 / 60

Page 136: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why Probability?

Helps us envision hypotheticals

Describes uncertainty in how the data is generated

Data Analysis: estimate probability that something will happen

Thus: we need to know how probability gives rise to data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 32 / 60

Page 137: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Why Probability?

Helps us envision hypotheticals

Describes uncertainty in how the data is generated

Data Analysis: estimate probability that something will happen

Thus: we need to know how probability gives rise to data

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 32 / 60

Page 138: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Intuitive Definition of Probability

While there are several interpretations of what probability is, mostmodern (post 1935 or so) researchers agree on an axiomaticdefinition of probability.

3 Axioms (Intuitive Version):

1 The probability of any particular event must be non-negative.

2 The probability of anything occurring among all possible eventsmust be 1.

3 The probability of one of many mutually exclusive eventshappening is the sum of the individual probabilities.

All the rules of probability can be derived from these axioms.(we will return to these in a minute)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 33 / 60

Page 139: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Intuitive Definition of Probability

While there are several interpretations of what probability is, mostmodern (post 1935 or so) researchers agree on an axiomaticdefinition of probability.

3 Axioms (Intuitive Version):

1 The probability of any particular event must be non-negative.

2 The probability of anything occurring among all possible eventsmust be 1.

3 The probability of one of many mutually exclusive eventshappening is the sum of the individual probabilities.

All the rules of probability can be derived from these axioms.(we will return to these in a minute)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 33 / 60

Page 140: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Intuitive Definition of Probability

While there are several interpretations of what probability is, mostmodern (post 1935 or so) researchers agree on an axiomaticdefinition of probability.

3 Axioms (Intuitive Version):

1 The probability of any particular event must be non-negative.

2 The probability of anything occurring among all possible eventsmust be 1.

3 The probability of one of many mutually exclusive eventshappening is the sum of the individual probabilities.

All the rules of probability can be derived from these axioms.(we will return to these in a minute)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 33 / 60

Page 141: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Intuitive Definition of Probability

While there are several interpretations of what probability is, mostmodern (post 1935 or so) researchers agree on an axiomaticdefinition of probability.

3 Axioms (Intuitive Version):

1 The probability of any particular event must be non-negative.

2 The probability of anything occurring among all possible eventsmust be 1.

3 The probability of one of many mutually exclusive eventshappening is the sum of the individual probabilities.

All the rules of probability can be derived from these axioms.(we will return to these in a minute)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 33 / 60

Page 142: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Intuitive Definition of Probability

While there are several interpretations of what probability is, mostmodern (post 1935 or so) researchers agree on an axiomaticdefinition of probability.

3 Axioms (Intuitive Version):

1 The probability of any particular event must be non-negative.

2 The probability of anything occurring among all possible eventsmust be 1.

3 The probability of one of many mutually exclusive eventshappening is the sum of the individual probabilities.

All the rules of probability can be derived from these axioms.(we will return to these in a minute)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 33 / 60

Page 143: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Intuitive Definition of Probability

While there are several interpretations of what probability is, mostmodern (post 1935 or so) researchers agree on an axiomaticdefinition of probability.

3 Axioms (Intuitive Version):

1 The probability of any particular event must be non-negative.

2 The probability of anything occurring among all possible eventsmust be 1.

3 The probability of one of many mutually exclusive eventshappening is the sum of the individual probabilities.

All the rules of probability can be derived from these axioms.(we will return to these in a minute)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 33 / 60

Page 144: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Intuitive Definition of Probability

While there are several interpretations of what probability is, mostmodern (post 1935 or so) researchers agree on an axiomaticdefinition of probability.

3 Axioms (Intuitive Version):

1 The probability of any particular event must be non-negative.

2 The probability of anything occurring among all possible eventsmust be 1.

3 The probability of one of many mutually exclusive eventshappening is the sum of the individual probabilities.

All the rules of probability can be derived from these axioms.(we will return to these in a minute)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 33 / 60

Page 145: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 146: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 147: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 148: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 149: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =

heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 150: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads,

heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 151: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails,

tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 152: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads,

tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 153: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 154: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.

(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 155: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Sample Spaces

To define probability we need to define the set of possible outcomes.

The sample space is the set of all possible outcomes, and is oftenwritten as S or Ω.

For example, if we flip a coin twice, there are four possible outcomes,

S =heads, heads, heads, tails, tails, heads, tails, tails

Thus the table in Lady Tasting Tea was defining the sample space.(Note we defined illogical guesses to be prob= 0)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 34 / 60

Page 156: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Running Visual Metaphor

Imagine that we sample an apple from a bag.Looking in the bag we see:

The sample space is:

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 35 / 60

Page 157: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Running Visual Metaphor

Imagine that we sample an apple from a bag.

Looking in the bag we see:

The sample space is:

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 35 / 60

Page 158: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Running Visual Metaphor

Imagine that we sample an apple from a bag.Looking in the bag we see:

The sample space is:

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 35 / 60

Page 159: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Running Visual Metaphor

Imagine that we sample an apple from a bag.Looking in the bag we see:

The sample space is:

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 35 / 60

Page 160: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

EventsEvents are subsets of the sample space.

For Example, if

then

, ,

and

are both events.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 36 / 60

Page 161: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

EventsEvents are subsets of the sample space.

For Example, if

then

, ,

and

are both events.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 36 / 60

Page 162: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

EventsEvents are subsets of the sample space.

For Example, if

then

, ,

and

are both events.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 36 / 60

Page 163: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Events Are a Kind of Set

Sets are collections of things, in this case collections of outcomes

One way to define an event is to describe the common property thatall of the outcomes share. We write this as

ω|ω satisfies P,

where P is the property that they all share.

If A = ω|ω has a leaf :

A,A, A, A

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 37 / 60

Page 164: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Events Are a Kind of SetSets are collections of things, in this case collections of outcomes

One way to define an event is to describe the common property thatall of the outcomes share. We write this as

ω|ω satisfies P,

where P is the property that they all share.

If A = ω|ω has a leaf :

A,A, A, A

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 37 / 60

Page 165: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Events Are a Kind of SetSets are collections of things, in this case collections of outcomes

One way to define an event is to describe the common property thatall of the outcomes share. We write this as

ω|ω satisfies P,

where P is the property that they all share.

If A = ω|ω has a leaf :

A,A, A, A

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 37 / 60

Page 166: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Events Are a Kind of SetSets are collections of things, in this case collections of outcomes

One way to define an event is to describe the common property thatall of the outcomes share. We write this as

ω|ω satisfies P,

where P is the property that they all share.

If A = ω|ω has a leaf :

A,A, A, A

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 37 / 60

Page 167: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

ComplementA complement of event A is a set: Ac , is collection of all of theoutcomes not in A. That is, it is “everything else” in the samplespace.

, ,

and

are complements.

Ac = ω ∈ Ω|ω /∈ A.

Important complement: Ωc = ∅, where ∅ is the empty set—it’s justthe event that nothing happens.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 38 / 60

Page 168: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

ComplementA complement of event A is a set: Ac , is collection of all of theoutcomes not in A. That is, it is “everything else” in the samplespace.

, ,

and

are complements.

Ac = ω ∈ Ω|ω /∈ A.

Important complement: Ωc = ∅, where ∅ is the empty set—it’s justthe event that nothing happens.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 38 / 60

Page 169: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

ComplementA complement of event A is a set: Ac , is collection of all of theoutcomes not in A. That is, it is “everything else” in the samplespace.

, ,

and

are complements.

Ac = ω ∈ Ω|ω /∈ A.

Important complement: Ωc = ∅, where ∅ is the empty set—it’s justthe event that nothing happens.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 38 / 60

Page 170: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Unions and intersections (Operations on events)

The union of two events, A and B is the event that A or B occurs:

=

, ,A ∪ B = ω|ω ∈ A or ω ∈ B.

The intersection of two events, A and B is the event that both A andB occur:

=

A ∩ B = ω|ω ∈ A and ω ∈ B.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 39 / 60

Page 171: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Unions and intersections (Operations on events)

The union of two events, A and B is the event that A or B occurs:

=

, ,A ∪ B = ω|ω ∈ A or ω ∈ B.

The intersection of two events, A and B is the event that both A andB occur:

=

A ∩ B = ω|ω ∈ A and ω ∈ B.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 39 / 60

Page 172: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Unions and intersections (Operations on events)

The union of two events, A and B is the event that A or B occurs:

=

, ,A ∪ B = ω|ω ∈ A or ω ∈ B.

The intersection of two events, A and B is the event that both A andB occur:

=

A ∩ B = ω|ω ∈ A and ω ∈ B.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 39 / 60

Page 173: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Operations on Events

We say that two events A and B are disjoint or mutually exclusive ifthey don’t share any elements or that A ∩ B = ∅.

An event and its complement A and Ac are disjoint.

Sample spaces can have infinite events A1,A2, . . ..

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 40 / 60

Page 174: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Operations on Events

We say that two events A and B are disjoint or mutually exclusive ifthey don’t share any elements or that A ∩ B = ∅.

An event and its complement A and Ac are disjoint.

Sample spaces can have infinite events A1,A2, . . ..

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 40 / 60

Page 175: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Operations on Events

We say that two events A and B are disjoint or mutually exclusive ifthey don’t share any elements or that A ∩ B = ∅.

An event and its complement A and Ac are disjoint.

Sample spaces can have infinite events A1,A2, . . ..

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 40 / 60

Page 176: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Operations on Events

We say that two events A and B are disjoint or mutually exclusive ifthey don’t share any elements or that A ∩ B = ∅.

An event and its complement A and Ac are disjoint.

Sample spaces can have infinite events A1,A2, . . ..

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 40 / 60

Page 177: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events.

nonnegativity

2 P(S) = 1

normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).

additivity

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 178: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events.

nonnegativity

2 P(S) = 1

normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).

additivity

1. P( ) = -.5

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 179: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events. nonnegativity

2 P(S) = 1

normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).

additivity

1. P( ) = -.5

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 180: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events. nonnegativity

2 P(S) = 1

normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).

additivity

1. P( ) = -.5

2. P( ) = , , , 1

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 181: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events. nonnegativity

2 P(S) = 1 normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).

additivity

1. P( ) = -.5

2. P( ) = , , , 1

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 182: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events. nonnegativity

2 P(S) = 1 normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).

additivity

1. P( ) = -.5

2. P( ) = , , , 1

3. P( ) = P( ) P( )+when and aremutually exclusive.

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 183: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events. nonnegativity

2 P(S) = 1 normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).additivity

1. P( ) = -.5

2. P( ) = , , , 1

3. P( ) = P( ) P( )+when and aremutually exclusive.

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 184: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Probability FunctionA probability function P(·) is a function defined over all subsets of asample space S that satisfies the following three axioms:

1 P(A) ≥ 0 for all A in the setof all events. nonnegativity

2 P(S) = 1 normalization

3 if events A1,A2, . . . aremutually exclusive thenP(⋃∞

i=1 Ai) =∑∞

i=1 P(Ai).additivity

1. P( ) = -.5

2. P( ) = , , , 1

3. P( ) = P( ) P( )+when and aremutually exclusive.

All the rules of probability can be derived from these axioms.(See Blitzstein & Hwang, Def 1.6.1.)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 41 / 60

Page 185: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 186: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 187: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective Interpretation

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is whatever you want it to be. But...

I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 188: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be.

But...I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 189: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...

I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 190: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...I If you don’t follow the axioms, a bookie can beat you

I There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 191: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 192: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency Interpretation

I Probability is the relative frequency with which an event wouldoccur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 193: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 194: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Brief Word on Interpretation

Massive debate on interpretation:

Subjective InterpretationI Example: The probability of drawing 5 red cards out of 10

drawn from a deck of cards is whatever you want it to be. But...I If you don’t follow the axioms, a bookie can beat youI There is a correct way to update your beliefs with data.

Frequency InterpretationI Probability is the relative frequency with which an event would

occur if the process were repeated a large number of timesunder similar conditions.

I Example: The probability of drawing 5 red cards out of 10drawn from a deck of cards is the frequency with which thisevent occurs in repeated samples of 10 cards.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 42 / 60

Page 195: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Three Big Ideas

Marginal, joint, and conditional probabilities

Bayes’ rule

Independence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 43 / 60

Page 196: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Three Big Ideas

Marginal, joint, and conditional probabilities

Bayes’ rule

Independence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 43 / 60

Page 197: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Three Big Ideas

Marginal, joint, and conditional probabilities

Bayes’ rule

Independence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 43 / 60

Page 198: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Three Big Ideas

Marginal, joint, and conditional probabilities

Bayes’ rule

Independence

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 43 / 60

Page 199: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Marginal and Joint Probability

So far we have only considered situations where we are interested inthe probability of a single event A occurring. We’ve denoted thisP(A). P(A) is sometimes called a marginal probability.

Suppose we are now in a situation where we would like to express theprobability that an event A and an event B occur. This quantity iswritten as P(A ∩ B), P(B ∩ A), P(A,B), or P(B ,A) and is the jointprobability of A and B .

P( ), = P( ) P( )=

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 44 / 60

Page 200: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Marginal and Joint Probability

So far we have only considered situations where we are interested inthe probability of a single event A occurring. We’ve denoted thisP(A). P(A) is sometimes called a marginal probability.

Suppose we are now in a situation where we would like to express theprobability that an event A and an event B occur. This quantity iswritten as P(A ∩ B), P(B ∩ A), P(A,B), or P(B ,A) and is the jointprobability of A and B .

P( ), = P( ) P( )=

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 44 / 60

Page 201: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

P( ), = ?

P( ) = ?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 45 / 60

Page 202: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability

The “soul of statistics”If P(A) > 0 then the probability of B conditional on A can be writtenas

P(B |A) =P(A,B)

P(A)

This implies that

P(A,B) = P(A)× P(B |A)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 46 / 60

Page 203: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability

The “soul of statistics”

If P(A) > 0 then the probability of B conditional on A can be writtenas

P(B |A) =P(A,B)

P(A)

This implies that

P(A,B) = P(A)× P(B |A)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 46 / 60

Page 204: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability

The “soul of statistics”If P(A) > 0 then the probability of B conditional on A can be writtenas

P(B |A) =P(A,B)

P(A)

This implies that

P(A,B) = P(A)× P(B |A)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 46 / 60

Page 205: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability

The “soul of statistics”If P(A) > 0 then the probability of B conditional on A can be writtenas

P(B |A) =P(A,B)

P(A)

This implies that

P(A,B) = P(A)× P(B |A)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 46 / 60

Page 206: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability

The “soul of statistics”If P(A) > 0 then the probability of B conditional on A can be writtenas

P(B |A) =P(A,B)

P(A)

This implies that

P(A,B) = P(A)× P(B |A)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 46 / 60

Page 207: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability

The “soul of statistics”If P(A) > 0 then the probability of B conditional on A can be writtenas

P(B |A) =P(A,B)

P(A)

This implies that

P(A,B) = P(A)× P(B |A)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 46 / 60

Page 208: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability: A Visual Example

P( )| =P( ),

P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 47 / 60

Page 209: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability: A Visual Example

P( )| =P( ),

P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 47 / 60

Page 210: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Conditional Probability: A Visual Example

P( )| =P( ),

P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 47 / 60

Page 211: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Card Player’s Example

If we randomly draw two cards from a standard 52 card deck anddefine the eventsA = King on Draw 1 and B = King on Draw 2, then

P(A) = 4/52

P(B |A) = 3/51

P(A,B) = P(A)× P(B |A) = 4/52× 3/51 ≈ .0045

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 48 / 60

Page 212: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Card Player’s Example

If we randomly draw two cards from a standard 52 card deck anddefine the eventsA = King on Draw 1 and B = King on Draw 2, then

P(A) =

4/52

P(B |A) = 3/51

P(A,B) = P(A)× P(B |A) = 4/52× 3/51 ≈ .0045

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 48 / 60

Page 213: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Card Player’s Example

If we randomly draw two cards from a standard 52 card deck anddefine the eventsA = King on Draw 1 and B = King on Draw 2, then

P(A) = 4/52

P(B |A) = 3/51

P(A,B) = P(A)× P(B |A) = 4/52× 3/51 ≈ .0045

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 48 / 60

Page 214: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Card Player’s Example

If we randomly draw two cards from a standard 52 card deck anddefine the eventsA = King on Draw 1 and B = King on Draw 2, then

P(A) = 4/52

P(B |A) =

3/51

P(A,B) = P(A)× P(B |A) = 4/52× 3/51 ≈ .0045

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 48 / 60

Page 215: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Card Player’s Example

If we randomly draw two cards from a standard 52 card deck anddefine the eventsA = King on Draw 1 and B = King on Draw 2, then

P(A) = 4/52

P(B |A) = 3/51

P(A,B) = P(A)× P(B |A) = 4/52× 3/51 ≈ .0045

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 48 / 60

Page 216: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Card Player’s Example

If we randomly draw two cards from a standard 52 card deck anddefine the eventsA = King on Draw 1 and B = King on Draw 2, then

P(A) = 4/52

P(B |A) = 3/51

P(A,B) = P(A)× P(B |A) =

4/52× 3/51 ≈ .0045

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 48 / 60

Page 217: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

A Card Player’s Example

If we randomly draw two cards from a standard 52 card deck anddefine the eventsA = King on Draw 1 and B = King on Draw 2, then

P(A) = 4/52

P(B |A) = 3/51

P(A,B) = P(A)× P(B |A) = 4/52× 3/51 ≈ .0045

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 48 / 60

Page 218: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Law of Total Probability (LTP)

With 2 Events:

P(B) = P(B ,A) + P(B ,Ac)

= P(B |A)× P(A) + P(B |Ac)× P(Ac)

= P( ) P( )+P( )

= P( )| x P( ) + P( )| x P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 49 / 60

Page 219: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Recall, if we randomly draw two cards from a standard 52 card deckand define the events A = King on Draw 1 andB = King on Draw 2, then

P(A) = 4/52

P(B |A) = 3/51

P(A,B) = P(A)× P(B |A) = 4/52× 3/51

Question: P(B) =?

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 50 / 60

Page 220: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Confirming Intuition with the LTP

P(B) = P(B ,A) + P(B ,Ac)

= P(B |A)× P(A) + P(B |Ac)× P(Ac)

P(B) = 3/51× 1/13 + 4/51× 12/13

=3 + 48

51× 13=

1

13=

4

52

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 51 / 60

Page 221: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Confirming Intuition with the LTP

P(B) = P(B ,A) + P(B ,Ac)

= P(B |A)× P(A) + P(B |Ac)× P(Ac)

P(B) = 3/51× 1/13 + 4/51× 12/13

=3 + 48

51× 13=

1

13=

4

52

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 51 / 60

Page 222: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Confirming Intuition with the LTP

P(B) = P(B ,A) + P(B ,Ac)

= P(B |A)× P(A) + P(B |Ac)× P(Ac)

P(B) = 3/51× 1/13 + 4/51× 12/13

=3 + 48

51× 13=

1

13=

4

52

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 51 / 60

Page 223: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote].

We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 224: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 225: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 226: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 227: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 228: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not.

Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 229: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 230: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 231: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 232: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Voter MobilizationSuppose that we have put together a voter mobilization campaignand we want to know what the probability of voting is after thecampaign: Pr[vote]. We know the following:

Pr(vote|mobilized) = 0.75

Pr(vote|not mobilized) = 0.15

Pr(mobilized) = 0.6 and so Pr( not mobilized) = 0.4

Note that mobilization partitions the data. Everyone is eithermobilized or not. Thus, we can apply the LTP:

Pr(vote) = Pr(vote|mobilized) Pr(mobilized)+

Pr(vote|not mobilized) Pr(not mobilized)

=0.75× 0.6 + 0.15× 0.4

=.51

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 52 / 60

Page 233: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule

Often we have information about Pr(B |A), but require Pr(A|B)instead.

When this happens, always think: Bayes’ rule

Bayes’ rule: if Pr(B) > 0, then:

Pr(A|B) =Pr(B |A) Pr(A)

Pr(B)

Proof: combine the multiplication rulePr(B |A) Pr(A) = P(A ∩ B), and the definition of conditionalprobability

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 53 / 60

Page 234: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule

Often we have information about Pr(B |A), but require Pr(A|B)instead.

When this happens, always think: Bayes’ rule

Bayes’ rule: if Pr(B) > 0, then:

Pr(A|B) =Pr(B |A) Pr(A)

Pr(B)

Proof: combine the multiplication rulePr(B |A) Pr(A) = P(A ∩ B), and the definition of conditionalprobability

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 53 / 60

Page 235: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule

Often we have information about Pr(B |A), but require Pr(A|B)instead.

When this happens, always think: Bayes’ rule

Bayes’ rule: if Pr(B) > 0, then:

Pr(A|B) =Pr(B |A) Pr(A)

Pr(B)

Proof: combine the multiplication rulePr(B |A) Pr(A) = P(A ∩ B), and the definition of conditionalprobability

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 53 / 60

Page 236: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule

Often we have information about Pr(B |A), but require Pr(A|B)instead.

When this happens, always think: Bayes’ rule

Bayes’ rule: if Pr(B) > 0, then:

Pr(A|B) =Pr(B |A) Pr(A)

Pr(B)

Proof: combine the multiplication rulePr(B |A) Pr(A) = P(A ∩ B), and the definition of conditionalprobability

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 53 / 60

Page 237: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule

Often we have information about Pr(B |A), but require Pr(A|B)instead.

When this happens, always think: Bayes’ rule

Bayes’ rule: if Pr(B) > 0, then:

Pr(A|B) =Pr(B |A) Pr(A)

Pr(B)

Proof: combine the multiplication rulePr(B |A) Pr(A) = P(A ∩ B), and the definition of conditionalprobability

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 53 / 60

Page 238: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule Mechanics

P( )| =P(

P( ))| P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 54 / 60

Page 239: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule Mechanics

P( )| =P(

P( ))| P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 54 / 60

Page 240: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule Mechanics

P( )| =P(

P( ))| P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 54 / 60

Page 241: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule Mechanics

P( )| =P(

P( ))| P( )

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 54 / 60

Page 242: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule Example

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 55 / 60

Page 243: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Bayes’ Rule Example

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 55 / 60

Page 244: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 245: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 246: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 247: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 248: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132

I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 249: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868

I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 250: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378

I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 251: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 252: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 253: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Enos (2015): how do we identify a person’s race from theirname?

First, note that the Census collects information on thedistribution of names by race.

For example, Washington is the most common last name amongAfrican-Americans in America:

I Pr(AfAm) = 0.132I Pr(not AfAm) = 1− Pr(AfAm) = .868I Pr(Washington|AfAm) = 0.00378I Pr(Washington|not AfAm) = 0.000061

We can now use Bayes’ Rule

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 56 / 60

Page 254: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Note we don’t have the probability of the name Washington.

Remember that we can calculate it from the LTP since the setsAfrican-American and not African-American partition the samplespace:

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

=Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash|AfAm) Pr(AfAm) + Pr(Wash|not AfAm) Pr(not AfAm)

=0.132× 0.00378

0.132× 0.00378 + .868× 0.000061

≈ 0.9

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 57 / 60

Page 255: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Note we don’t have the probability of the name Washington.

Remember that we can calculate it from the LTP since the setsAfrican-American and not African-American partition the samplespace:

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

=Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash|AfAm) Pr(AfAm) + Pr(Wash|not AfAm) Pr(not AfAm)

=0.132× 0.00378

0.132× 0.00378 + .868× 0.000061

≈ 0.9

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 57 / 60

Page 256: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Note we don’t have the probability of the name Washington.

Remember that we can calculate it from the LTP since the setsAfrican-American and not African-American partition the samplespace:

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

=

Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash|AfAm) Pr(AfAm) + Pr(Wash|not AfAm) Pr(not AfAm)

=0.132× 0.00378

0.132× 0.00378 + .868× 0.000061

≈ 0.9

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 57 / 60

Page 257: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Note we don’t have the probability of the name Washington.

Remember that we can calculate it from the LTP since the setsAfrican-American and not African-American partition the samplespace:

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

=Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash|AfAm) Pr(AfAm) + Pr(Wash|not AfAm) Pr(not AfAm)

=0.132× 0.00378

0.132× 0.00378 + .868× 0.000061

≈ 0.9

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 57 / 60

Page 258: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Note we don’t have the probability of the name Washington.

Remember that we can calculate it from the LTP since the setsAfrican-American and not African-American partition the samplespace:

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

=Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash|AfAm) Pr(AfAm) + Pr(Wash|not AfAm) Pr(not AfAm)

=0.132× 0.00378

0.132× 0.00378 + .868× 0.000061

≈ 0.9

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 57 / 60

Page 259: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Example: Race and Names

Note we don’t have the probability of the name Washington.

Remember that we can calculate it from the LTP since the setsAfrican-American and not African-American partition the samplespace:

Pr(AfAm|Wash) =Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash)

=Pr(Wash|AfAm) Pr(AfAm)

Pr(Wash|AfAm) Pr(AfAm) + Pr(Wash|not AfAm) Pr(not AfAm)

=0.132× 0.00378

0.132× 0.00378 + .868× 0.000061

≈ 0.9

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 57 / 60

Page 260: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive DefinitionEvents A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 261: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive Definition

Events A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 262: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive DefinitionEvents A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 263: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive DefinitionEvents A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 264: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive DefinitionEvents A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 265: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive DefinitionEvents A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 266: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive DefinitionEvents A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 267: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Independence

Intuitive DefinitionEvents A and B are independent if knowing whether A occurredprovides no information about whether B occurred.

Formal Definition

P(A,B) = P(A)P(B) =⇒ A⊥⊥B

With all the usual > 0 restrictions, this implies

P(A|B) = P(A)

P(B |A) = P(B)

Independence is a massively important concept in statistics.

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 58 / 60

Page 268: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Next Week

Homework do on Thursday (8 days)! Why have homeworkassigned on the first day?

Random Variables

Reading for Random VariablesI Blitzstein and Hwang Chapters 2, 3-3.2 (random variables),

4-4.2 (expectation), 4.4-4.6 (indicator rv, LOTUS, variance),5.1-5.4 (continuous random variables), 7.0-7.3 (jointdistributions)

I Optional: Imai Chapter 6 (probability), Aronow and MillerChapter 2

A word from your preceptors

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 59 / 60

Page 269: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Next Week

Homework do on Thursday (8 days)! Why have homeworkassigned on the first day?

Random Variables

Reading for Random VariablesI Blitzstein and Hwang Chapters 2, 3-3.2 (random variables),

4-4.2 (expectation), 4.4-4.6 (indicator rv, LOTUS, variance),5.1-5.4 (continuous random variables), 7.0-7.3 (jointdistributions)

I Optional: Imai Chapter 6 (probability), Aronow and MillerChapter 2

A word from your preceptors

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 59 / 60

Page 270: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Next Week

Homework do on Thursday (8 days)! Why have homeworkassigned on the first day?

Random Variables

Reading for Random Variables

I Blitzstein and Hwang Chapters 2, 3-3.2 (random variables),4-4.2 (expectation), 4.4-4.6 (indicator rv, LOTUS, variance),5.1-5.4 (continuous random variables), 7.0-7.3 (jointdistributions)

I Optional: Imai Chapter 6 (probability), Aronow and MillerChapter 2

A word from your preceptors

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 59 / 60

Page 271: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Next Week

Homework do on Thursday (8 days)! Why have homeworkassigned on the first day?

Random Variables

Reading for Random VariablesI Blitzstein and Hwang Chapters 2, 3-3.2 (random variables),

4-4.2 (expectation), 4.4-4.6 (indicator rv, LOTUS, variance),5.1-5.4 (continuous random variables), 7.0-7.3 (jointdistributions)

I Optional: Imai Chapter 6 (probability), Aronow and MillerChapter 2

A word from your preceptors

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 59 / 60

Page 272: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

Next Week

Homework do on Thursday (8 days)! Why have homeworkassigned on the first day?

Random Variables

Reading for Random VariablesI Blitzstein and Hwang Chapters 2, 3-3.2 (random variables),

4-4.2 (expectation), 4.4-4.6 (indicator rv, LOTUS, variance),5.1-5.4 (continuous random variables), 7.0-7.3 (jointdistributions)

I Optional: Imai Chapter 6 (probability), Aronow and MillerChapter 2

A word from your preceptors

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 59 / 60

Page 273: Soc500: Applied Social Statistics Week 1: Introduction and ...Soc500: Applied Social Statistics Week 1: Introduction and Probability Brandon Stewart1 Princeton September 12, 2018 1These

References

Enos, Ryan D. ”What the demolition of public housing teachesus about the impact of racial threat on political behavior.”American Journal of Political Science (2015).

J. Andrew Harris “Whats in a Name? A Method for ExtractingInformation about Ethnicity from Names” Political Analysis(2015).

Salsburg, David. The Lady Tasting Tea: How StatisticsRevolutionized Science in the Twentieth Century (2002).

Stewart (Princeton) Week 1: Introduction and Probability September 12, 2018 60 / 60