Top Banner
HUDM4122 Probability and Statistical Inference February 16, 2015
110

HUDM4122 Probability and Statistical Inference February 16, 2015.

Dec 17, 2015

Download

Documents

Lizbeth Harris
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: HUDM4122 Probability and Statistical Inference February 16, 2015.

HUDM4122Probability and Statistical Inference

February 16, 2015

Page 2: HUDM4122 Probability and Statistical Inference February 16, 2015.

In the last class

• We started Ch. 4.4 in Mendenhall, Beaver, & Beaver

Page 3: HUDM4122 Probability and Statistical Inference February 16, 2015.

Today

• Ch. 4.4-4.6 in Mendenhall, Beaver, & Beaver

Page 4: HUDM4122 Probability and Statistical Inference February 16, 2015.

Today

• Sampling without Replacement• Permutations• Combinations• Independence• Conditional Probability

Page 5: HUDM4122 Probability and Statistical Inference February 16, 2015.

We ended last class with this problem

• I call a radio station, where they make me pick a number between 1 and 10

• If I get today’s winning number, I get free tickets to hear Justin Bieber (oh lucky me)

• Let’s say I call 5 days in a row

• What is the probability I get tickets on exactly one day? (my daughter will still be excited)

Page 6: HUDM4122 Probability and Statistical Inference February 16, 2015.

Solution A (professor)

= 0.40951

Page 7: HUDM4122 Probability and Statistical Inference February 16, 2015.

Solution B (class)

= 0.32805

Page 8: HUDM4122 Probability and Statistical Inference February 16, 2015.

These represent two different problems

• One (my solution) is sampling without replacement

• The other is the sum of five independent events

• So which one is right?

Page 9: HUDM4122 Probability and Statistical Inference February 16, 2015.

Let’s first compute the sample space

• I call a radio station, where they make me pick a number between 1 and 10

• I call 5 days in a row

• 10*10*10*10*10 = 100,000

Page 10: HUDM4122 Probability and Statistical Inference February 16, 2015.

Let’s first compute the sample space

• I call a radio station, where they make me pick a number between 1 and 10

• I call 5 days in a row

• 10*10*10*10*10 = 100,000• A.K.A. too many to list out

Page 11: HUDM4122 Probability and Statistical Inference February 16, 2015.

Let’s first compute the sample space

• I call a radio station, where they make me pick a number between 1 and 10

• I call 5 days in a row

• 10*10*10*10*10 = 100,000• A.K.A. too many to list out– Or is it?

Page 12: HUDM4122 Probability and Statistical Inference February 16, 2015.

bieber-tix-example.xlsx

Page 13: HUDM4122 Probability and Statistical Inference February 16, 2015.

bieber-tix-example.xlsx

• 0.32805

Page 14: HUDM4122 Probability and Statistical Inference February 16, 2015.

So, the professor was wrong

Page 15: HUDM4122 Probability and Statistical Inference February 16, 2015.

So, the professor was wrong

• It turns out sleep deprivation is bad for cognition

Page 16: HUDM4122 Probability and Statistical Inference February 16, 2015.

So, the professor was wrong

• It turns out sleep deprivation is bad for cognition

• Don’t try this on your midterm

Page 17: HUDM4122 Probability and Statistical Inference February 16, 2015.

So why was this correct?

Page 18: HUDM4122 Probability and Statistical Inference February 16, 2015.

5 days to win 1 ticket

• CXXXX• XCXXX• XXCXX• XXXCX• XXXXC

5 days

Page 19: HUDM4122 Probability and Statistical Inference February 16, 2015.

In each of 5 days,1 answer where right

5 daysCorrect answer

Page 20: HUDM4122 Probability and Statistical Inference February 16, 2015.

And 4 more days where wrong; and there are 9 wrong answers

5 casesCorrect answer

Wrong answers

Wrong days

Page 21: HUDM4122 Probability and Statistical Inference February 16, 2015.

Each day is independent from other days – that’s why this is the correct math

5 casesCorrect answer

Wrong answers

Wrong days

Page 22: HUDM4122 Probability and Statistical Inference February 16, 2015.

Questions? Comments?

Page 23: HUDM4122 Probability and Statistical Inference February 16, 2015.

The sample space for multi-stage data collection can be calculated using

• The Extended mn rule

Page 24: HUDM4122 Probability and Statistical Inference February 16, 2015.

The Extended mn rule

• Let’s say you have k stages of your data collection

Page 25: HUDM4122 Probability and Statistical Inference February 16, 2015.

The Extended mn rule

• Let’s say you have k stages of data collection– Unlike the book, I don’t call it an experiment,

because that term is usually given a more specific meaning by researchers

Page 26: HUDM4122 Probability and Statistical Inference February 16, 2015.

The Extended mn rule

• Let’s say you have k stages of data collection• And there are n1 ways to accomplish the first

stage• And n2 ways to accomplish the 2nd stage

• And n3 ways to accomplish the 3rd stage

• And nk ways to accomplish the kth stage

• Then the sample space = n1 * n2 * n3 … * nk

Page 27: HUDM4122 Probability and Statistical Inference February 16, 2015.

Any questions about theExtended mn rule?

• Let’s say you have k stages of data collection• And there are n1 ways to accomplish the first

stage• And n2 ways to accomplish the 2nd stage

• And n3 ways to accomplish the 3rd stage

• And nk ways to accomplish the kth stage

• Then the sample space = n1 * n2 * n3 … * nk

Page 28: HUDM4122 Probability and Statistical Inference February 16, 2015.

Note that there doesn’t have to be the same probability in each stage!

Page 29: HUDM4122 Probability and Statistical Inference February 16, 2015.

This can come in useful in cases that are not truly independent

• Unlike the Justin Bieber example

Page 30: HUDM4122 Probability and Statistical Inference February 16, 2015.

Independence

• Two events A and B are independent if A does not affect B and B does not affect A

Page 31: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin again

Page 32: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin againFlipping a biased coin Flipping same biased coin again

Page 33: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin againFlipping a biased coin Flipping same biased coin again

Bob parties late night before midterm

Bob falls asleep during midterm

Page 34: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin againFlipping a biased coin Flipping same biased coin again

Bob parties late night before midterm

Bob falls asleep during midterm

Bob’s grade on midterm Bob’s grade on final

Page 35: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin againFlipping a biased coin Flipping same biased coin again

Bob parties late night before midterm

Bob falls asleep during midterm

Bob’s grade on midterm Bob’s grade on finalBob disrupts class Bob gets expelled

Page 36: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin againFlipping a biased coin Flipping same biased coin again

Bob parties late night before midterm

Bob falls asleep during midterm

Bob’s grade on midterm Bob’s grade on finalBob disrupts class Bob gets expelledBob gets expelled Bob takes job at McDonald’s

Page 37: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin againFlipping a biased coin Flipping same biased coin again

Bob parties late night before midterm

Bob falls asleep during midterm

Bob’s grade on midterm Bob’s grade on finalBob disrupts class Bob gets expelledBob gets expelled Bob takes job at McDonald’s

Bob takes job at McDonald’s Bob wins lottery

Page 38: HUDM4122 Probability and Statistical Inference February 16, 2015.

Which of these are independent?A B

Flipping a fair coin Flipping same fair coin againFlipping a biased coin Flipping same biased coin again

Bob parties late night before midterm

Bob falls asleep during midterm

Bob’s grade on midterm Bob’s grade on finalBob disrupts class Bob gets expelledBob gets expelled Bob takes job at McDonald’s

Bob takes job at McDonald’s Bob wins lotteryBob wins lottery Bob becomes ill due to

congenital heart problem

Page 39: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example of probability calculation with non-independent events

• Let’s say that I invite 6 friends over to play Beer Hunter

Page 40: HUDM4122 Probability and Statistical Inference February 16, 2015.

This can come in useful in cases that are not truly independent

• Let’s say that 6 friends decide to play Beer Hunter

• Rules are– 6 cans of beer– 1 violently shaken before playing and then

shuffled– Each person chooses a can and opens it

Page 41: HUDM4122 Probability and Statistical Inference February 16, 2015.

Initial Sample Space = 6

• 1 Bad outcome• 5 Perfectly fine outcomes– If you don’t like beer, imagine it’s root beer

Page 42: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• The probability of a bad outcome for friend 1 is 1/6

Page 43: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• The probability of a bad outcome for friend 1 is 1/6

• But if friend 1 comes out OK• The probability of a bad outcome for friend 2

is 1/5, not 1/6!

Page 44: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• The probability of a bad outcome for friend 1 is 1/6

• But if friend 1 comes out OK• The probability of a bad outcome for friend 2

is 1/5, not 1/6!

• Does everyone see why?

Page 45: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• The probability of a bad outcome for friend 1 is 1/6

• But if friend 1 comes out OK• The probability of a bad outcome for friend 2 is

1/5, not 1/6!

• Does everyone see why?– Friend 1 already opened a beer can, and it went ok– This only leaves 5 closed beer cans

Page 46: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• If friend 2 comes out OK• The probability of a bad outcome for friend 3

is 1/4

Page 47: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• If friend 2 comes out OK• The probability of a bad outcome for friend 3

is 1/4

• If friend 3 comes out OK• The probability of a bad outcome for friend 4

is 1/3

Page 48: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• If friend 4 comes out OK• The probability of a bad outcome for friend 5

is 1/2

Page 49: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• If friend 4 comes out OK• The probability of a bad outcome for friend 5

is 1/2

• If friend 5 comes out OK

Page 50: HUDM4122 Probability and Statistical Inference February 16, 2015.

Probability of bad outcome

• If friend 4 comes out OK• The probability of a bad outcome for friend 5

is 1/2

• If friend 5 comes out OK• Friend 6 will need to get a clean shirt

Page 51: HUDM4122 Probability and Statistical Inference February 16, 2015.

True sample space

• 6 *5 * 4 * 3 * 2 * 1

• This is called sampling without replacement

Page 52: HUDM4122 Probability and Statistical Inference February 16, 2015.

Any questions?

Page 53: HUDM4122 Probability and Statistical Inference February 16, 2015.

Now you do an example

• In pairs

Page 54: HUDM4122 Probability and Statistical Inference February 16, 2015.

Now you do an example

• Let’s say I’m a roadie for the band Van Halen

Page 55: HUDM4122 Probability and Statistical Inference February 16, 2015.

Now you do an example

• Let’s say I’m a roadie for the band Van Halen– Professors need to moonlight to make ends meet

in this city

Page 56: HUDM4122 Probability and Statistical Inference February 16, 2015.

Now you do an example

• Let’s say I’m a roadie for the band Van Halen• They have a “no brown M&M’s” clause in their

contract, and if any of the four members get a brown M&M, I’m fired

Page 57: HUDM4122 Probability and Statistical Inference February 16, 2015.

Now you do an example

• Let’s say I’m a roadie for the band Van Halen• They have a “no brown M&M’s” clause in their

contract, and if any of the four members get a brown M&M, I’m fired

• I’ve just handed them a bowl with 20 M&Ms, including one brown M&M

• Each band member takes 1 M&M without looking

Page 58: HUDM4122 Probability and Statistical Inference February 16, 2015.

Now you do an example

• Let’s say I’m a roadie for the band Van Halen• They have a “no brown M&M’s” clause in their

contract, and if any of the four members get a brown M&M, I’m fired

• I’ve just handed them a bowl with 20 M&Ms, including one brown M&M

• Each band member takes 1 M&M without looking

• What is the sample space?• What is the probability I get fired?

Page 59: HUDM4122 Probability and Statistical Inference February 16, 2015.

Questions? Comments?

Page 60: HUDM4122 Probability and Statistical Inference February 16, 2015.

Another example: permutations

• An application of sampling without replacement

• How many orderings can you have between a certain number of objects?

Page 61: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• Let’s say that I’m redecorating my office in preparation for a visit from a funder from the US army (“Bob”), a funder from the National Science Foundation (“Janet”), and a funder from the US Department of Education (“Ed”)

• I want to place Bob’s book, Janet’s book, and Ed’s book in a place of honor next to my desk

• How many different orders can I put their books in?

Page 62: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• The first book could be Bob’s, Janet’s, or Ed’s• If the first book is Bob’s, the second book can

only be Janet’s or Ed’s• If the first book is Bob’s, and the second book

is Janet’s, then the third book can only be Ed’s

• We’re back to the same math of 3*2*1

Page 63: HUDM4122 Probability and Statistical Inference February 16, 2015.

Any questions?

Page 64: HUDM4122 Probability and Statistical Inference February 16, 2015.

Formal equations

• The sample space for n stages, sampling without replacement, is

• n * (n-1) * (n-2) * (n-3) * (n-4) until (n-k)=1

• This is written n!– Pronounced “n factorial”

Page 65: HUDM4122 Probability and Statistical Inference February 16, 2015.

Formal equations

• The number of permutations for n objects, taking all of them together, is

• Still n!

• n * (n-1) * (n-2) * (n-3) * (n-4) until (n-k)=1

Page 66: HUDM4122 Probability and Statistical Inference February 16, 2015.

Do it yourself

• What is the number of permutations for 4 objects?

• What is the number of permutations for 6 objects?

• What is the number of permutations for 10 objects?

Page 67: HUDM4122 Probability and Statistical Inference February 16, 2015.

Do it yourself

• What is the number of permutations for 4 objects?– 4*3*2*1=24

• What is the number of permutations for 6 objects?– 6*5*4*3*2*1=720

• What is the number of permutations for 10 objects?– 10*9*8*7*6*5*4*3*2*1= 3,628,800

Page 68: HUDM4122 Probability and Statistical Inference February 16, 2015.

Ryan’s daughter suggests

• “Daddy, why don’t we try organizing all the books on your bookshelves in every possible way?”

• I own approximately 600 books

• Is this a good idea?

Page 69: HUDM4122 Probability and Statistical Inference February 16, 2015.

Questions? Comments?

Page 70: HUDM4122 Probability and Statistical Inference February 16, 2015.

More general case

• If we only want to pick r objects out of the n total objects, the equation becomes

Page 71: HUDM4122 Probability and Statistical Inference February 16, 2015.

Using general case equation

• If I want to find out how many orderings of 2 books I can get from 6 total books– n!/(n-r)! – 6!/(6-2)!– 6!/4! – 720/24– 30 possible orderings

Page 72: HUDM4122 Probability and Statistical Inference February 16, 2015.

Using general case equation

• If I want to find out how many orderings of 4 books I can get from 6 total books– n!/(n-r)! – 6!/(6-4)!– 6!/2! – 720/2– 360 possible orderings

Page 73: HUDM4122 Probability and Statistical Inference February 16, 2015.

Any questions?

Page 74: HUDM4122 Probability and Statistical Inference February 16, 2015.

Related problem: Combinations

• If we don’t care about order, but only want to know how many combinations of items we can get

Page 75: HUDM4122 Probability and Statistical Inference February 16, 2015.

Combination formula

• The number of combinations of r objects out of n total objects is

Page 76: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• I have five friends, and three tickets to see Ferrari Truck

• How many combinations of friends could I potentially bring?

Page 77: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• I have five friends, and three tickets to see Ferrari Truck

• How many combinations of friends could I potentially bring?

• = = = = = 10

Page 78: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• I have 600 books, and want to take 3 books on a ridiculously long flight to the First Uzbekistani Conference on Educational Data Mining

• How many combinations of books could I potentially bring?

Page 79: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• I have 600 books, and want to take 3 books on a ridiculously long flight to the First Uzbekistani Conference on Educational Data Mining

• How many combinations of books could I potentially bring?

• = = = =

• 35.8 million

Page 80: HUDM4122 Probability and Statistical Inference February 16, 2015.

Your turn

• Peter’s Pizzeria has 6 toppings, and a 2-topping special

• How many combinations of toppings could you get, and have the special?

Page 81: HUDM4122 Probability and Statistical Inference February 16, 2015.

Questions?

Page 82: HUDM4122 Probability and Statistical Inference February 16, 2015.

An application

• In my son’s play group, he has 6 playmates– 5 friends and 1 frenemy

• What is the probability that on a specific playdate with 2 friends, it will involve the frenemy?

Page 83: HUDM4122 Probability and Statistical Inference February 16, 2015.

Can be written as

• Number of playdates that involve frenemy = 5– Frenemy plus each of 5 friends

• Total number of combinations (2 of 6)

Page 84: HUDM4122 Probability and Statistical Inference February 16, 2015.

Can be written as

• Number of playdates that involve frenemy = 5– Frenemy plus each of 5 friends

• Total number of combinations (2 of 6) = 15

• 5/15 = 1/3

Page 85: HUDM4122 Probability and Statistical Inference February 16, 2015.

Any questions?

Page 86: HUDM4122 Probability and Statistical Inference February 16, 2015.

Conditional Probability

• Let’s take two non-independent events, A and B

• P(A | B) =• Probability of A,• Given that we know that B occurred

Page 87: HUDM4122 Probability and Statistical Inference February 16, 2015.

Conditional Probability

• Let’s take two non-independent events, A and B

• P(A | B) =• Probability of A,• Given that we know that B occurred

• Note that this tells us nothing about P(A | ~B) P(A) overall

Page 88: HUDM4122 Probability and Statistical Inference February 16, 2015.

General Multiplication Rule

• Probability of A and B equals • Probability of A• Multiplied by • Probability of B, given A

Page 89: HUDM4122 Probability and Statistical Inference February 16, 2015.

General Multiplication Rule

• Probability of A and B equals • Probability of A• Multiplied by • Probability of B, given A

• Formally

Page 90: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• P(Q) = 0.2• P(R|Q) = 0.7

• P(Q R) = ?

Page 91: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• P(Q) = 0.2• P(R|Q) = 0.7

• P(Q R) = 0.14

Page 92: HUDM4122 Probability and Statistical Inference February 16, 2015.

You try it

• P(F) = 0.7• P(G|F) = 0.5

• P(F G) = ?

Page 93: HUDM4122 Probability and Statistical Inference February 16, 2015.

You try it

• P(F) = 0.7• P(G|F) = 0.5

• P(F G) = 0.35

Page 94: HUDM4122 Probability and Statistical Inference February 16, 2015.

You try it

• P(X) = 0.3• P(X|Y) = 0.9

• P(X Y) = ?

Page 95: HUDM4122 Probability and Statistical Inference February 16, 2015.

You try it

• P(X) = 0.3• P(X|Y) = 0.9

• P(X Y) = ?

• Impossible to calculate from information given

Page 96: HUDM4122 Probability and Statistical Inference February 16, 2015.

A Concrete Example

• P(Bob parties late, night before exam) = 0.5• P(Bob does badly on exam | parties late) = 0.7• P(Bob does badly on exam | ~parties late) = 0.2

• What is the probability that Bob parties late and does badly?

Page 97: HUDM4122 Probability and Statistical Inference February 16, 2015.

A Concrete Example

• P(Bob parties late, night before exam) = 0.5• P(Bob does badly on exam | parties late) = 0.7• P(Bob does badly on exam | ~parties late) = 0.2

• What is the probability that Bob parties late and does badly?

• 0.35

Page 98: HUDM4122 Probability and Statistical Inference February 16, 2015.

Conditional Probability Formula

• A mathematical transformation of the General Multiplication Rule

• That rule was

Page 99: HUDM4122 Probability and Statistical Inference February 16, 2015.

Conditional Probability Formula

• If you divide both sides by P(A)

• Which resolves to

Page 100: HUDM4122 Probability and Statistical Inference February 16, 2015.

Conditional Probability Formula

• P(B|A) =

Page 101: HUDM4122 Probability and Statistical Inference February 16, 2015.

Conditional Probability Formula

• P(B|A) =

• Note that P(A) can’t equal 0, or you’re dividing by 0…

Page 102: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• P(B|A) =

• P(Bob parties late, night before exam) = 0.5• P(Bob does badly on exam AND parties late) =

0.35

• What is the probability that does badly, given that he parties late?

Page 103: HUDM4122 Probability and Statistical Inference February 16, 2015.

Example

• P(B|A) =

• P(Bob parties late, night before exam) = 0.5• P(Bob does badly on exam AND parties late) =

0.35

• What is the probability that does badly, given that he parties late? – 0.7

Page 104: HUDM4122 Probability and Statistical Inference February 16, 2015.

You try it

• P(B|A) =

• P(Student is from a wealthy family) = 0.25• P(Student goes to college AND comes from

wealthy family) = 0.21

• What is the probability that a student goes to college, given that he/she comes from a wealthy family?

Page 105: HUDM4122 Probability and Statistical Inference February 16, 2015.

You try it

• P(B|A) =

• P(Student is from a wealthy family) = 0.25• P(Student goes to college AND comes from wealthy

family) = 0.21

• What is the probability that a student goes to college, given that he/she comes from a wealthy family?– 0.84

Page 106: HUDM4122 Probability and Statistical Inference February 16, 2015.

What if we want to determine

• P(B|A) =

• P(Student is from a wealthy family) = 0.25• P(Student goes to college AND comes from

wealthy family) = 0.21

• What is the probability that a student comes from a wealthy family, given that he/she went to college?

Page 107: HUDM4122 Probability and Statistical Inference February 16, 2015.

What if we want to determine

• P(B|A) =

• P(Student is from a wealthy family) = 0.25• P(Student goes to college AND comes from wealthy

family) = 0.21

• What is the probability that a student comes from a wealthy family, given that he/she went to college?– We can’t tell, using this formula…

Page 108: HUDM4122 Probability and Statistical Inference February 16, 2015.

Upcoming Classes

• 2/18 Bayes Theorem– Ch. 4-7– HW3 due

• 2/23 Discrete Random Variables and Their Probability Distributions– Ch. 4-8

Page 109: HUDM4122 Probability and Statistical Inference February 16, 2015.

Homework 3

• Due in 2 days• In the ASSISTments system

Page 110: HUDM4122 Probability and Statistical Inference February 16, 2015.

Questions? Comments?