Top Banner
1 ME 233 Advanced Control II Lecture 8 Discrete Time Linear Quadratic Gaussian (LQG) Optimal Control (ME233 Class Notes pp.LQG1-LQG7)
54

ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Mar 17, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

1

ME 233 Advanced Control II

Lecture 8

Discrete Time

Linear Quadratic Gaussian (LQG)

Optimal Control

(ME233 Class Notes pp.LQG1-LQG7)

Page 2: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

2

Outline

• Stochastic optimization

• Finite horizon LQG

– State feedback optimal LQG control

– Output feedback optimal LQG control

Page 3: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

3

Stochastic ControlLinear system contaminated by noise:

YX(zI-A )-1 CB

+ +U

W

VBw

Two random disturbances:

• Input noise w(k) - contaminates the state x(k)

• Measurement noise v(k) - contaminates the

output y(k)

Page 4: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

4

Stochastic state model

Where:

• available output

• control input

• Gaussian, uncorrelated, zero mean, input noise

• Gaussian, uncorrelated, zero mean, meas. noise

• Gaussian initial state

Page 5: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

5

Assumptions (same as for KF)

• Initial conditions:

• Noise properties:

Zero-mean

Gaussian

uncorrelated

noises

Page 6: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Some notation- control and measurements

The control sequence from k to N-1

6

The optimal control sequence from k to N-1

The output measurements up to k

Page 7: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

7

Finite-horizon LQG

For N > 0, find the optimal control sequence:

Which minimizes the cost functional:

where can only be based on the observations

Page 8: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

8

Separation Principle

Main Theorem:

The optimal control is given by:

Where:

• The feedback gain K(k) is obtained from the

deterministic LQR solution.

• The state estimate is the a-posteriori

Kalman Filter state estimate.

Page 9: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

9

Separation Principle

U Y

zB

YX

(zI-A )-1 CB

+ -

F(k)

A

+

-

X

o

Y~

X

C

o

-+

- K(k+1)

A-posteriori KF

Deterministic LQR

+

Page 10: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

10

Separation Principle Proof

The proof of the separation principle is conducted

in two steps:

1. Solve the LQG problem under the assumption

that the state vector is measurable

2. Solve the LQG problem and show that the

optimal solution is obtained by replacing

by the a-posteriori state estimate

Page 11: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

11

Finite-horizon state feedback LQG

This problem is similar to the standard

deterministic finite-horizon LQR…

…except that there is an additional input noise…

…and the control is only allowed to be a

function of

Page 12: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Functionality constraint on control

• The control u(k) is only allowed to be a

function of x(0),…,x(k)

• We write this constraint as

• We write the constraints

for k=m,…,N-1 as

12

Page 13: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

13

Finite-horizon state feedback LQG

We want to solve using dynamic programming:

Need 2 preliminary results:

1. Functional optimization

2. Stochastic Bellman equation

Page 14: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Functional optimization

Lemma 1:

Let X be a random vector and let denote

the constraint that u is a function of X

Also assume that there exists uo(x) such that

Then

14

Page 15: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Functional optimization

Proof is in 2 parts:

1.

2.

15

Page 16: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof:

Let uo(x) minimize f(x,u)

u is a function of X

Because

Page 17: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof:

• Let

• Minimizing the right-hand side over

completes the proof

u is a function of X

This holds, regardless of how was chosen

Page 18: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Definitions

• Terminal cost

• Stage cost (transient cost)

• Optimal cost to go

18

Page 19: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Stochastic Bellman equation

Lemma 2:

If for

Then

19

Page 20: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof: (m=N-1 case is trivial, and thus omitted)

Page 21: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

21

Finite-horizon state feedback LQG

Theorem 1:

a) The optimal control is given by

Standard deterministic LQR solution!

Page 22: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

22

Theorem 1:

b) The optimal cost Jo is given by

Finite-horizon state feedback LQG

Page 23: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

23

Theorem 1:

b) The optimal cost is given by

This term reflects the detrimental effect of w(k) on the cost

b(k) is a dynamic function of the noise intensity

b(k) is computed backwards in time with b(N) = 0

Finite-horizon state feedback LQG

Page 24: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

24

Theorem 1:

b) The optimal cost is given by

Finite-horizon state feedback LQG

Deterministic LQR

cost associated

with mean of x(0)

Detrimental effect of

randomness of x(0)

on the cost

Detrimental effect

of w(0),…,w(k)

on the cost

Page 25: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof consists of 2 steps:

1. Prove

and using induction

on decreasing m, Lemma 1, and the stochastic

Bellman equation (Lemma 2)

2. Prove

25

Finite-horizon state feedback LQG

Page 26: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and .

Start with base case: m=N

26

Page 27: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and .

For m=0,1,…,N-1:

(We use induction on decreasing m)

27

By the induction hypothesis,

Term 1

Term 2

Term 3

Page 28: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and . 28

Term 2

Since x(m) and u(m) only depend on quantities

that are independent from w(m)

Ax(m) + Bu(m) is independent from w(m)

0

Page 29: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and . 29

Term 3

Page 30: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and .

Therefore

30

Page 31: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and . 31

Now use stochastic Bellman equation

Page 32: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and . 32

• Use Lemma 1 to exchange min and E

• b(m) does not depend on u(m)

Page 33: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof of Theorem 1: and . 33

This is the same optimization we

solved for deterministic LQR!

Optimal value:

Page 34: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Proof consists of 2 steps:

1. Prove

and using induction

on decreasing m, Lemma 1, and the stochastic

Bellman equation (Lemma 2)

2. Prove

34

Finite-horizon state feedback LQG

Page 35: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

35

Proof:

0

Page 36: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

36

Proof: (cont’d)

Page 37: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

37

Separation Principle Proof

The proof of the separation principle is conducted

in two steps:

1. Solve the LQG problem under the assumption

that the state vector is measurable

2. Solve the LQG problem and show that the

optimal solution is obtained by replacing

by the a-posteriori state estimate

Page 38: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

38

Finite-horizon LQG

This problem is similar to the standard

deterministic finite-horizon LQR…

…except that there is an additional input noise…

…and the control is only allowed to be a

function of

Page 39: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Functionality constraint on control

• The control u(k) is only allowed to be a

function of y(0),…,y(k)

• As before, we write this constraint as

• As before, we write the constraints

for k=m,…,N-1 as

39

Page 40: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

40

Finite-horizon LQG

We want to solve:

We will relate this to an optimal

state feedback LQG control problem

For simplicity, assume S = 0

Page 41: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG

• Examine

41

0 (by LS property 1)

Page 42: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG

• Therefore,

• Similarly,

• Want to apply these identities to LQG

42

(Recall that we assumed S = 0)

Page 43: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG43

Page 44: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG44

Terms

minimized by

the Kalman

filter

We will show that this corresponds to a

state feedback LQG control problem

Page 45: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

• From the Kalman filter :

• Recall that is uncorrelated and

Reformulation of LQG45

Page 46: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG46

Initial conditions:

Notate this as

Page 47: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG47

Initial conditions:

Correlation of with :

Page 48: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG48

Want to solve:

u(k) is a function of

u(k) is a function of

(because are functions of )

u(k) is a function of

(because , i.e. knowledge of does not

give any “information” about by LS property 1 )

Page 49: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

Reformulation of LQG49

Want to solve:

u(k) is a function of

This is a state feedback LQG control problem!

Apply results from first half of lecture

Uncorrelated with

Page 50: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

50

Optimal finite-horizon LQG, S=0

Main Theorem:

a) The optimal control is given by

Standard deterministic LQR solution!

Page 51: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

51

Optimal finite-horizon LQG, S=0

A-posteriori state observer structure:

Main Theorem:

Page 52: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

52

Optimal finite-horizon LQG, S=0Main Theorem:

b) The optimal cost is given by

Page 53: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

State space form of LQG controller53

Kalman

filter

LQR

Plugging this expression for uo(k) into the expression for

yields the state space model on the next slide

Eliminating from the expression for uo(k) yields

Page 54: ME 233 ReviewSolve the LQG problem under the assumption that the state vector is measurable 2. Solve the LQG problem and show that the optimal solution is obtained by replacing by

State space form of LQG controller54

where

K(k+1) is the standard deterministic LQR gain

F(k) and L(k) are the standard Kalman filter gains