Random Variables & E xpectation. Random Variable A random variable (r.v.) is a well defined rule for assigning a numerical value to all possible outcomes.

Random Variables & Expectation

Random VariableA random variable (r.v.) is a well defined rule for

assigning a numerical value to all possible outcomes of an experiment.

example:

experiment: taking a courseoutcomes: grades A, B, C, D, Fsample space S: discrete & finiterandom variable: Y = 4 if grade is A

Y = 3 if grade is BY = 2 if grade is CY = 1 if grade is DY = 0 if grade is F

Experiment: throw 2 diceWhat are the possible outcomes?

1,1 2,1 3,1 4,1 5,1 6,1

1,2 2,2 3,2 4,2 5,2 6,2

1,3 2,3 3,3 4,3 5,3 6,3

1,4 2,4 3,4 4,4 5,4 6,4

1,5 2,5 3,5 4,5 5,5 6,5

1,6 2,6 3,6 4,6 5,6 6,6

Define the random variable X to be the sum of the dots on the 2 dice.

For which outcomes does X = 9

1,1 2,1 3,1 4,1 5,1 6,1

1,2 2,2 3,2 4,2 5,2 6,2

1,3 2,3 3,3 4,3 5,3 6,3

1,4 2,4 3,4 4,4 5,4 6,4

1,5 2,5 3,5 4,5 5,5 6,5

1,6 2,6 3,6 4,6 5,6 6,6

For which outcomes does X = 9

1,1 2,1 3,1 4,1 5,1 6,1

1,2 2,2 3,2 4,2 5,2 6,2

1,3 2,3 3,3 4,3 5,3 6,3

1,4 2,4 3,4 4,4 5,4 6,4

1,5 2,5 3,5 4,5 5,5 6,5

1,6 2,6 3,6 4,6 5,6 6,6

What is Pr(X=9)?

1,1 2,1 3,1 4,1 5,1 6,1

1,2 2,2 3,2 4,2 5,2 6,2

1,3 2,3 3,3 4,3 5,3 6,3

1,4 2,4 3,4 4,4 5,4 6,4

1,5 2,5 3,5 4,5 5,5 6,5

1,6 2,6 3,6 4,6 5,6 6,6

Since there are 36 equally likely outcomes, each has a probability of 1/36.

So since there are 4 outcomes that yield X=9, Pr(X=9) = 4/36 =1/9

Let’s calculate the probabilities of all the possible values x of the random variable X

x Pr(X=x)1,1 2,1 3,1 4,1 5,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

Let’s calculate the probabilities of the possible values x of the random variable X

x Pr(X=x) 2 1/361,1 2,1 3,1 4,1 5,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36 6 5/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36 8 5/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36 8 5/36 9 4/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36 8 5/36 9 4/3610 3/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36 8 5/36 9 4/3610 3/3611 2/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

x Pr(X=x) 2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36 8 5/36 9 4/3610 3/3611 2/3612 1/36

1,1 2,1 3,1 4,1 5,16,1

1,2 2,2 3,2 4,2 5,26,2

1,3 2,3 3,3 4,3 5,36,3

1,4 2,4 3,4 4,4 5,46,4

1,5 2,5 3,5 4,5 5,56,5

1,6 2,6 3,6 4,6 5,66,6

Let’s graph the probability distribution of X.

x Pr(X=x)

2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36 8 5/36 9 4/3610 3/3611 2/3612 1/36

Pr(X=x)

2 3 4 5 6 7 8 9 10 11 12 x

Pr(X=x) = f(x) = p(x)as described in this table or graph is called the

probability distribution or probability mass function (p.m.f.)

x Pr(X=x)

2 1/36 3 2/36 4 3/36 5 4/36 6 5/36 7 6/36 8 5/36 9 4/3610 3/3611 2/3612 1/36

Pr(X=x)

2 3 4 5 6 7 8 9 10 11 12 x

Properties of Probability Distributions

1. 0 ≤ Pr(X=x) ≤ 1 for all x

2. 1)( x

Cumulative Mass Function

)()Pr()(00

xpxXxF

Cumulative Mass Function (2 dice problem)

x Pr(X=x) Pr(X≤x) 2 1/36 1/36 3 2/36 3/36 4 3/36 6/36 5 4/36 10/36 6 5/36 15/36 7 6/36 21/36 8 5/36 26/36 9 4/36 30/3610 3/36 33/3611 2/36 35/3612 1/36 1 0 1 2 3 4 5 6 7 8 9 10 11 12 13

Expectation, Expected Value, or Mean of a Random Variable

xxpXE )()(

Notice the similarity of the definitions of the mean of a random variable & the mean of

a frequency distribution for a population

fxfxN i

)/1( :distrib. freq. pop.

xxpXE )()(

Recall that probability [p(x)] is the relative frequency [f/N] with which something occurs over the long run.

So these definitions are saying the same thing.

Example: Suppose that a stock broker wants to estimate the price of a certain stock one year from now. If the probability mass function of the price in a year is as given, determine the expected price.

x = price in one year p(x)

94 0.25

98 0.25

102 0.25

106 0.25

x = price in one year p(x)

94 0.25

98 0.25

102 0.25

106 0.25

x = price in one year p(x) xp(x)

94 0.25 23.5

98 0.25 24.5

102 0.25 25.5

106 0.25 26.5

x = price in one year p(x) xp(x)

94 0.25 23.5

98 0.25 24.5

102 0.25 25.5

106 0.25 26.5

1.00 100.0

Notice that you do NOT divide by the number of observations when you’re done adding.

Also, the probabilities do not have to be equal; they just have to add up to one.

Theorem: Suppose that g(X) is a function of a random variable X, & the probability mass function of

X is px(x). Then the expected value of g(X) is

xxpxgXgE )()()]([

Example: Suppose Y = X2 & the distribution of X is as given below. Determine the mean of g(X) by using1. the definition of expected value, & 2. the previous theorem.

x p(x)

-2 0.1

-1 0.2

x p(x) y p(y)

-2 0.1

-1 0.2

x p(x) y p(y)

-2 0.1 1 0.5

-1 0.2

x p(x) y p(y)

-2 0.1 1 0.5

-1 0.2 4 0.5

x p(x) y p(y) yp(y)

-2 0.1 1 0.5 0.5

-1 0.2 4 0.5 2.0

x p(x) y p(y) yp(y)

-2 0.1 1 0.5 0.5

-1 0.2 4 0.5 2.0

1 0.3 E(Y) = 2.5

x p(x) y

-2 0.1 4

-1 0.2 1

1 0.3 1

2 0.4 4

x p(x) y ypx(x)

-2 0.1 4 0.4

-1 0.2 1 0.2

1 0.3 1 0.3

2 0.4 4 1.6

x p(x) y ypx(x)

-2 0.1 4 0.4

-1 0.2 1 0.2

1 0.3 1 0.3

2 0.4 4 1.6

E(Y) = 2.5

Definition:Variance of a random variable X

])[()(

Theorem:The variance of X can also be

calculated as follows:

222 XEXEXV )]([)()(

Standard Deviation of a random variable X

)(2 XV

Example: Suppose sales at a donut shop are distributed as below. Calculate (a) the mean number of donuts sold, (b) the variance (using both the definition of the variance & the theorem), & (c) the standard deviation.

x p(x)

1 0.08

2 0.27

4 0.10

6 0.33

12 0.22

First, the mean….

x p(x) xp(x)

1 0.08 0.08

2 0.27 0.54

4 0.10 0.40

6 0.33 1.98

12 0.22 2.64

x p(x) xp(x)

1 0.08 0.08

2 0.27 0.54

4 0.10 0.40

6 0.33 1.98

12 0.22 2.64

First, the mean….

Next, the variance using the definition:

x p(x) xp(x) x-

1 0.08 0.08 -4.64

2 0.27 0.54 -3.64

4 0.10 0.40 -1.64

6 0.33 1.98 0.36

12 0.22 2.64 6.36

)()(])[()( 222 xpXXEXVx

x p(x) xp(x) x- (x-

1 0.08 0.08 -4.64 21.53

2 0.27 0.54 -3.64 13.25

4 0.10 0.40 -1.64 2.69

6 0.33 1.98 0.36 0.13

12 0.22 2.64 6.36 40.45

)()(])[()( 222 xpXXEXVx

x p(x) xp(x) x- (x- (x-p(x)

1 0.08 0.08 -4.64 21.53 1.72

2 0.27 0.54 -3.64 13.25 3.58

4 0.10 0.40 -1.64 2.69 0.27

6 0.33 1.98 0.36 0.13 0.04

12 0.22 2.64 6.36 40.45 8.90

)()(])[()( 222 xpXXEXVx

x p(x) xp(x) x- (x- (x-p(x)

1 0.08 0.08 -4.64 21.53 1.72

2 0.27 0.54 -3.64 13.25 3.58

4 0.10 0.40 -1.64 2.69 0.27

6 0.33 1.98 0.36 0.13 0.04

12 0.22 2.64 6.36 40.45 8.90

=5.64 2 =14.51

)()(])[()( 222 xpXXEXVx

Now, the variance using the theorem:V(X) = E(X2)-[E(X)]2.

x p(x) xp(x) x- (x- (x-p(x) x2

1 0.08 0.08 -4.64 21.53 1.72 1

2 0.27 0.54 -3.64 13.25 3.58 4

4 0.10 0.40 -1.64 2.69 0.27 16

6 0.33 1.98 0.36 0.13 0.04 36

12 0.22 2.64 6.36 40.45 8.90 144

=5.64 2 =14.51

x p(x) xp(x) x- (x- (x-p(x) x2 x2p(x)

1 0.08 0.08 -4.64 21.53 1.72 1 0.08

2 0.27 0.54 -3.64 13.25 3.58 4 1.08

4 0.10 0.40 -1.64 2.69 0.27 16 1.60

6 0.33 1.98 0.36 0.13 0.04 36 11.88

12 0.22 2.64 6.36 40.45 8.90 144 31.68

=5.64 2 =14.51

1 0.08 0.08 -4.64 21.53 1.72 1 0.08

2 0.27 0.54 -3.64 13.25 3.58 4 1.08

4 0.10 0.40 -1.64 2.69 0.27 16 1.60

6 0.33 1.98 0.36 0.13 0.04 36 11.88

12 0.22 2.64 6.36 40.45 8.90 144 31.68

=5.64 2 =14.51 E(X2)=46.32

1 0.08 0.08 -4.64 21.53 1.72 1 0.08

2 0.27 0.54 -3.64 13.25 3.58 4 1.08

4 0.10 0.40 -1.64 2.69 0.27 16 1.60

6 0.33 1.98 0.36 0.13 0.04 36 11.88

12 0.22 2.64 6.36 40.45 8.90 144 31.68

=5.64 2 =14.51 E(X2)=46.32

2 = V(X) = E(X2) – [E(X)]2 = 46.32 – (5.64)2 = 14.51

And lastly, the standard deviation,by taking the square root of the variance.

1 0.08 0.08 -4.64 21.53 1.72 1 0.08

2 0.27 0.54 -3.64 13.25 3.58 4 1.08

4 0.10 0.40 -1.64 2.69 0.27 16 1.60

6 0.33 1.98 0.36 0.13 0.04 36 11.88

12 0.22 2.64 6.36 40.45 8.90 144 31.68

=5.64 2 =14.51 E(X2)=46.32

2 = V(X) = E(X2) – [E(X)]2 = 46.32 – (5.64)2 = 14.51 = 3.81

Important Theorem

If X has mean and variance 2, then (X-)/ has mean 0 and variance 1.

Example: (G-)/

Suppose your course grades have a mean of 2.7 and a standard deviation of 1.2.

Suppose you took your grades, subtracted 2.7 from each one, then divided those results by 1.2.

The new set of numbers would have a mean of 0 and a standard deviation of 1.

Expectation RulesLet k, a, & b be constants.

1. E(k) = k The mean of a constant is the constant.

2. V(k) = 0 The variance of a constant is zero.

3. E(a + bX) = a + b E(X)

4. V(a + bX) = b2 V(X)

Example: If X has a mean of 3 and a variance of 2/3, what are the mean and variance of Y=5+2X ?

First find the mean E(Y) = E(5+2X). E(a + bX) = a + b E(X).Let a=5 & b=2. Then just plug into the formula. So,E(Y) = E(5+2X) = 5 + 2 E(X) = 5 + 2(3) = 11.Next find the variance V(Y) = V(5+2X). V(a + bX) = b2 V(X).Again let a=5 and b=2 and just plug into the formula.V(Y) = V(5+2X) = 22 V(X) = 4 V(X) = 4(2/3) = 8/3.Notice that the constant term shifts the mean but has no

effect on the spread of the distribution.

Joint Probability Distribution for 2 Discrete Random Variables X & Y

p(x,y) = Pr(X=x and Y=y)

Properties of Joint Probability Distributions

y and x all for 1yxp0 1. ),(

1 y)p(x, 2.

Example: Consider the following joint distribution of the number of jobs & the number of promotions of college graduates in their 1st 5 years out of college.

Number of Promotions (y)

1 2 3 4

1 0.10 0.15 0.12 0.06

2 0.05 0.07 0.10 0.05

3 0.04 0.02 0.14 0.10Num

For example, the probability of 3 jobs & 2 promotions is 0.02.

1 2 3 4

1 0.10 0.15 0.12 0.06

2 0.05 0.07 0.10 0.05

3 0.04 0.02 0.14 0.10Num

We can determine the marginal distribution of the 2 random variables X & Y

just as we did before for 2 events.Just add across the row or down the column.

1 2 3 4

1 0.10 0.15 0.12 0.06

2 0.05 0.07 0.10 0.05

3 0.04 0.02 0.14 0.10

For the probability of 1 job…

Number of Promotions (y)pX(x):

marginal prob. of x

1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05

3 0.04 0.02 0.14 0.10

Similarly for the probabilities of 2 or 3 jobs …

marginal prob. of x

1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

For the probability of 1 promotion …

marginal prob. of x

1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

pY(y): marginal prob. of y

and for the probabilities of 2, 3, or 4 promotions …

marginal prob. of x

1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21

Notice again, that you must get at total one when you total the marginal probabilities for x and for y.

marginal prob. of x

1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

Conditional Probabilities for Random VariablesExample

The probability that X is 2 given that Y is 3:

pX|Y(2|3) = Pr(X=2|Y=3)

= Pr(X=2 & Y=3)/Pr(Y=3).

The probability that Y is 2 given that X is 3:

pY|X(2|3) = Pr(Y=2|X=3)

= Pr(Y=2 & X=3)/Pr(X=3).

Let’s do the calculations using our previous example.

marginal prob. of x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

pX|Y(2|3) = Pr(X=2|Y=3)

= Pr(X=2 & Y=3)/Pr(Y=3)

0.10/0.36 = 0.278.

pY|X(2|3) = Pr(Y=2|X=3)

= Pr(Y=2 & X=3)/Pr(X=3)

= 0.02/0.30 = 0.067.

Cumulative Joint Mass Function for 2 Discrete Random Variables X & Y

F(X,Y) = Pr(X ≤ x and Y ≤ y)

Job/Promotion Example: Find probability that a person had 2 or fewer jobs & 3 or fewer promotions

Number of Promotions (y) pX(x):

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

pY(y): marginal prob. of

0.19 0.24 0.36 0.21 1.00

F(2,3)

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) …

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) …

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) …

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) …

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) + f(2,1) …

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) + f(2,1) + f(2,2) …

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) + f(2,1) + f(2,2) + f(2,3) …

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) + f(2,1) + f(2,2) + f(2,3)

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) + f(2,1) + f(2,2) + f(2,3)

= 0.10 + 0.15 + 0.12 + 0.05 + 0.07 + 0.10

marginal prob. of

x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

F(2,3) = f(1,1) + f(1,2) + f(1,3) + f(2,1) + f(2,2) + f(2,3)

= 0.10 + 0.15 + 0.12 + 0.05 + 0.07 + 0.10

= 0.59

Independence

Recall that 2 events A & B were independent if Pr(A∩B)=Pr(A) Pr(B)

Similarly 2 random variables are independent if p(x,y) = pX(x) pY(y) for all values of x & y

In our previous example, are the number of jobs & number of promotions independent?

marginal prob. of x1 2 3 4

1 0.10 0.15 0.12 0.06 0.43

2 0.05 0.07 0.10 0.05 0.27

3 0.04 0.02 0.14 0.10 0.30

0.19 0.24 0.36 0.21 1.00

We must have p(x,y) = pX(x) pY(y) for all values of x & y.

To start, does p(1,1) equal pX(1) pY(1) ?

p(1,1) = 0.10

pX(1) pY(1) = 0.43 • 0.19

= 0.0817

≠ 0.10

So X & Y are not independent.

If that case had been equal, we wouldn’t be done yet. We’d have to verify that equality held for all the cells.

Theorem: mean of a function of 2 random variables X & Y

yxpyxgYXgE ),(),()],([

Suppose that based on the joint distribution of the length X & width Y of lumber sold by a lumberyard, we would like to determine the

mean length, mean width, & mean area of the lumber.

So we want to calculate

E(Y), and

E(XY).

Given the joint distribution below, calculate E(X), E(Y), & E(XY).

X4 0.05 0.05 0.10

8 0.10 0.50 0.20

First, determine the marginal distributions.

X4 0.05 0.05 0.10

8 0.10 0.50 0.20

YpX(x)

X4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

The marginal distribution of X ...

The marginal distribution of Y ...

YpX(x)

X4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30

Check that the marginal distribution probabilities sum to 1.

YpX(x)

X4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

Next we calculate the mean length & mean width.

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

For E(X), remember we need to multiply the values by their probabilities

and add up.

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

x p(x) xp(x)

We get the values of X and their probabilities …

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

x p(x) xp(x)

4 0.20

8 0.80

multiply …

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

x p(x) xp(x)

4 0.20 0.80

8 0.80 6.40

and add up.

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

x p(x) xp(x)

4 0.20 0.80

8 0.80 6.40

We now have our E(X).

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

x p(x) xp(x)

4 0.20 0.80

8 0.80 6.40

E(X) = 7.20

For E(Y), we do the same thing.

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

y p(y) yp(y)

Get the values of Y and their probabilities …

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

y p(y) yp(y)

2 0.15

4 0.55

6 0.30

multiply …

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

y p(y) yp(y)

2 0.15 0.30

4 0.55 2.20

6 0.30 1.80

and add up.

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

y p(y) yp(y)

2 0.15 0.30

4 0.55 2.20

6 0.30 1.80

There’s our E(Y).

YpX(x)

4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

y p(y) yp(y)

2 0.15 0.30

4 0.55 2.20

6 0.30 1.80

E(Y) = 4.30

To calculate the mean area E(XY), we use the theorem

YpX(x)

X4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

yxpxyXYE ),( ][For the mean area, E(XY), the theorem translates to

YpX(x)

X4 0.05 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

yxp xyXYE ),(][

To keep track of the xy terms, we are going to put them in our table.

YpX(x)

X4 0.05 (8) 0.05 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

yxpxyXYE ),( ][

YpX(x)

X4 0.05 (8) 0.05 (16) 0.10 0.20

8 0.10 0.50 0.20 0.80

pY(y) 0.15 0.55 0.30 1.00

yxpxyXYE ),( ][

YpX(x)