Top Banner
Stats for Engineers: Lecture 3
19

Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Mar 29, 2015

Download

Documents

Denise Dunmore
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Stats for Engineers: Lecture 3

Page 2: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

1 2 3 4 5

11%14%

3%

29%

44%

Conditional probability

Suppose there are three cards:

A red card that is red on both sides, A white card that is white on both sides, and A mixed card that is red on one side and white on the other.

All the cards are placed into a hat and one is pulled at random and placed on a table.

If the side facing up is red, what is the probability that the other side is also red?

1. 1/62. 1/33. 1/24. 2/35. 5/6

Page 3: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Conditional probability

Suppose there are three cards:

A red card that is red on both sides, A white card that is white on both sides, and A mixed card that is red on one side and white on the other.

All the cards are placed into a hat and one is pulled at random and placed on a table.

If the side facing up is red, what is the probability that the other side is also red?

Red card

White card

Mixed card

13

13

13

Top Red

Top White

Top White

Top Red

12

12

1

1 13

13

16

16

Let R=red card, TR = top red.

¿23

¿

13

13+ 1

6

Probability tree

Page 4: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Conditional probability

Suppose there are three cards:

A red card that is red on both sides, A white card that is white on both sides, and A mixed card that is red on one side and white on the other.

All the cards are placed into a hat and one is pulled at random and placed on a table.

If the side facing up is red, what is the probability that the other side is also red?

The probability we want is P(R|TR) since having the red card is the only way for the other side also to be red.

Let R=red card, W = white card, M = mixed card. Let TR = top is a red face.

For a random draw P(R)=P(W)=P(M)=1/3.

𝑃 (𝑇 𝑅 )=𝑃 (𝑇 𝑅|𝑅 )𝑃 (𝑅 )+𝑃 (𝑇 𝑅|𝑀 )𝑃 (𝑀 )¿1 ×

13+

12

×13

Total probability rule:

¿12

This is

¿1×

13

12

¿23

Intuition: 2/3 of the three red faces are on the red card.

Page 5: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Summary From Last Time

Permutations - ways of ordering k items: k!

Ways of choosing k things from n, irrespective of ordering:

𝐶𝑘𝑛=(𝑛𝑘)= 𝑛 !

𝑘! (𝑛−𝑘 )!

Random Variables: Discreet and Continuous

Mean 𝜇=𝐸 ( 𝑓 ( 𝑋 ) ) ≡ ⟨ 𝑓 ( 𝑋 ) ⟩=∑𝑘

𝑓 (𝑘 ) 𝑃 (𝑋=𝑘)

⟨𝑎𝑋+𝑏𝑌 ⟩=⟨𝑎𝑋 ⟩+ ⟨𝑏𝑌 ⟩=𝑎 ⟨ 𝑋 ⟩ +𝑏 ⟨𝑌 ⟩=𝑎𝜇𝑋+𝑏𝜇𝑌Means add:

Bayes’ Theorem 𝑃 ( 𝐴|𝐵 )= 𝑃 (𝐵|𝐴 )𝑃 ( 𝐴)𝑃 (𝐵 )

𝑃 ( 𝐴|𝐵 )= 𝑃 ( 𝐴∩𝐵 )𝑃 (𝐵 )e.g. from

Total Probability Rule: 𝑃 (𝐵 )=∑𝑘

𝑃 (𝐵|𝐴𝑘 )𝑃 ( 𝐴𝑘)

Page 6: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Mean of a product of independent random variables

If and are independent random variables, then

¿∑𝑥

𝑃 (𝑥 )𝑥∑𝑦

𝑃 (𝑦 ) 𝑦

¿ ⟨ 𝑋 ⟩ ⟨𝑌 ⟩=𝜇𝑋 𝜇𝑌

⟨ 𝑋𝑌 ⟩=∑𝑥∑𝑦

𝑃 (𝑥∩ 𝑦 )𝑥𝑦=¿∑𝑥∑𝑦

𝑃 (𝑥 )𝑃 (𝑦 ) 𝑥𝑦 ¿

Note: in general this is not true if the variables are not independent

Example: If I throw two dice, what is the mean value of the product of the throws?

Two throws are independent, so

The mean of one throw is

¿ (1+2+3+4+5+6 )× 16=

216

=3.5

¿1 ×16+2 ×

16+3 ×

16+4 ×

16+5 ×

16+6 ×

16

Page 7: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Variance and standard deviation of a distribution For a random variable X taking values 0, 1, 2 the mean is a measure of the average value of a distribution, . The standard deviation, , is a measure of how spread out the distribution is

𝑃 (𝑋=𝑘)

𝜇𝜎𝜎

𝑘

Page 8: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Definition of the variance (=

So the variance can also be written

⟨ ( 𝑋 −𝜇 )2 ⟩= ⟨𝑋 2− 2𝑋 𝜇+𝜇2 ⟩Note that

¿ ⟨𝑋 2 ⟩ − 2 ⟨ 𝑋 𝜇 ⟩+𝜇2

¿ ⟨𝑋 2 ⟩ − 2𝜇2+𝜇2

¿ ⟨𝑋 2 ⟩ −𝜇2

𝜇 ⟨ 𝑋 ⟩=𝜇2

𝜎 2≡ var (𝑋 )=⟨ ( 𝑋−𝜇)2 ⟩=¿

This equivalent form is often easier to evaluate in practice, though can be less numerically stable (e.g. when subtracting two large numbers).

Page 9: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Example: what is the mean and standard deviation of the result of a dice throw?

Answer: Let be the random variable that is the number on the dice

The mean is as shown previously.

The variance is = (=

Hence the standard deviation is 𝜇=3.5

𝜎𝜎

Page 10: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Sums of variances For two independent (or just uncorrelated) random variables X and Y the variance of X+Y is given by the sums of the separate variances.

Hence

Why? If has , and has , then

Hence since , if then

var

¿⟨ (𝑋 −𝜇𝑋 )2+(𝑌 −𝜇𝑌 )2+2 (𝑋−𝜇𝑋 ) (𝑌 −𝜇 𝑦 )⟩

If X and Y are independent (or just uncorrelated) then

= [“Variances add”]

Page 11: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

var (𝑋+𝑌+𝑍+… )=var (𝑋 )+var (𝑌 )+var (𝑍 )+…

In general, for both discrete and continuous independent (or uncorrelated) random variables

Example:

The mean weight of people in England is μ=72.4kg, with standard deviation 15kg.

What is the mean and standard deviation of the weight of the passengers on a plane carrying 200 people?

Answer:

The total weight

Since means add

Assuming weights independent, variances also add, w

𝜎=√45000 Kg2 ≈ 212 Kg𝜎𝑀2 =∑

𝑖=1

200

225 Kg2=200 ×225 Kg2=45000 Kg2

In reality be careful - assumption of independence unlikely to be accurate

Page 12: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

1 2 3 4 5

13%

7% 6%

43%

30%

Error bars

A bridge uses 100 concrete slabs, each weighing tonnes [i.e. the standard deviation of each is 0.1 tonnes]

What is the total weight in tonnes of the concrete slabs?

1.

Page 13: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Error bars

A bridge uses 100 concrete slabs, each weighing tonnes [i.e. the standard deviation of each is 0.1 tonnes]

What is the total weight in tonnes of the concrete slabs?

Means add, so

Note: Error grows with the square root of the number:

fractional error decreases

But the mean of the total is

Hence

Variances add, with , so

Page 14: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Binomial distribution A process with two possible outcomes, "success" and "failure" (or yes/no, etc.) is called a Bernoulli trial.

Discrete Random Variables

e.g. coin tossing: Heads or Tails

quality control: Satisfactory or Unsatisfactory

An experiment consists of n independent Bernoulli trials and p = probability of success for each trial. Let X = total number of successes in the n trials.

Then for k = 0, 1, 2, ... , n.

This is called the Binomial distribution with parameters n and p, or B(n, p) for short. X ~ B(n, p) stands for "X has the Binomial distribution with parameters n and p."

(𝑛𝑘)𝑝𝑘 (1 −𝑝 )𝑛−𝑘

Reminder:

(𝑛𝑘)

Polling: Agree or disagree

Page 15: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Situations where a Binomial might occur 1) Quality control: select n items at random; X = number found to be satisfactory. 2) Survey of n people about products A and B; X = number preferring A. 3) Telecommunications: n messages; X = number with an invalid address. 4) Number of items with some property above a threshold; e.g. X = number with height > A

Page 16: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

"X = k" means k successes (each with probability p) and n-k failures (each with probability 1-p).

Justification

Suppose for the moment all the successes come first. Assuming independence

probability =

=

successes: failures:

Every possible different ordering also has this same probability. The total number of ways of choosing k out of the n trails to be successes is , so there are , possible orderings.

Since each ordering is an exclusive possibility, by the special addition rule the overall probability is added times:

Page 17: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

𝑝=0.5

𝑃(𝑋=𝑘)

Page 18: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Example: If I toss a coin 100 times, what is the probability of getting exactly 50 tails?

Answer:

Let X = number tails in 100 tosses

Bernoulli trial: tail or head,

𝑃 (𝑋=50 )=𝐶𝑘𝑛𝑝𝑘(1−𝑝)𝑛−𝑘=𝐶50

100 0.550 (1− 0.5 )50

≈ 0.0796

Page 19: Stats for Engineers: Lecture 3. Conditional probability Suppose there are three cards: A red card that is red on both sides, A white card that is white.

Example: A component has a 20% chance of being a dud. If five are selected from a large batch, what is the probability that more than one is a dud?

P(More than one dud) =

Bernoulli trial: dud or not dud,

=

Answer:

Let X = number of duds in selection of 5

=

= 1 - 0.32768 - 0.4096 0.263.