1 Control of Large Scale Systems Jari Hätönen *,** April 2, 2003 * Department of Automatic Control and Systems Engineering, University of Sheffield, UK.

1

Control of Large Scale Systems

Jari Hätönen*,**

April 2, 2003*Department of Automatic Control and Systems Engineering,

University of Sheffield, UK

**Systems Engineering Laboratory

University of Oulu,

Finland

2

Introduction

The design of large (complex) systems commonly requires the division of the large system into smaller subsystems

For each subsystem it is necessary to define the subsystem model, the objective of the subsystem, and the constraints present in the subsystem

3

Introduction

The overall structure resulting from the interconnections of the subsystems can be very complex – in this talk only Two-level hierarchical systems are considered

In hierarchical systems each subsystem has its own decision unit and control unit

The decision unit and the control unit are responsible for making the subsystem to achieve its objectives

4

A Two-Level Hierarchical System

Coordinator

1 2 N

Upp

er le

vel

Low

er le

vel

Lower-leveldecision making

Processlevel1 2 N…

…

5

Introduction

The hierarchical system theory has a strong connection with organisational theory!!!

Also connections with economical models can be found, i.e. a market driven economy can be considered as a two-level hierarchical systems where the prices of the products are the coordination variables determined by the government of pure competition.

6

Introduction

The degree of interconnectedness (the more interconnected, the more difficult it is to obtain overall balance) is highly dependent on system design.

For example in chemical unit processes buffer tanks can be used to cut the physical interconnection between two units (resulting in higher cost)

7

Why hierarchical systems are so important and common?

The system design is easier to control (module thinking)

Subsystem allow specialisation, i.e. each subsystem is only responsible for its own task and does not require information how the overall system works.

Maybe “evolution” also encourages hierarchical systems (i.e. the brain, pre-historic tribes etc).

8

Why hierarchical systems are so important and common?

They allow a certain degree of fault tolerance, i.e. if a sub-system breaks down, it can be easily replaced.

However, the coordinator is the weak point, i.e. if it stops working, the system stops functioning.

Interesting implications to warfare (i.e. Hussein and his closest allies were the first ones to be attacked).

9

Hierarchical systems and dynamics at different time-scales

Large plants have typically subsystems that have dynamics at different scales (i.e. in a paper machine the paper quality is kept fixed for a week, but the paper machine dynamics excited by disturbances have dynamics of few seconds).

Consequently it is natural to take the slow dynamics as the upper-level and the fast dynamics as the lower-level.

10

Hierarchical systems and dynamics at different time-scales

Coordination variables can selected to be for example the constant set-points for the lower-level decision units, that classically are PID-controllers.

The coordination variables (constant set-points) can be selected to be a solution of suitable (static) optimisation problem.

11

PROCESS MODEL

Coordinator

1 2 N

Upp

er le

vel

Low

er le

vel



…

12

Mathematical preliminaries

For each subsystem there exists a mapping

iii OIf :

where the triplet (Ii,Oi,fi) defines the input-outputmodel for subsystem i

The input and output domains are further divided into

iii

iii

ZYO

OMI

13

Mathematical preliminaries

The set Mi are the free inputs of the systems and Xi are the interconnected input, i.e the set Xi is determined by the behaviour of other subsystems

The Zi is set of interconnected outputs, i.e. they are used as inputs in other subsystems. The set Yi is the set of free outputs.

14

Free and interconnected variables

fiii Ii ii Oo

fi

ii Mm ii Yy

ii Xx ii Zz

15

An example

f1 f2 f3

m1 m2m3 y3

z11 x21 z31

y2y1

z22

z21 x31

x11

x12

z12

x32

16

A general two-level hierarchical system

fm y

C

zx

)(

),(),(

zCx

xmfzy

17


Furthermore, it can be shown that

N

ijiji zCx

1

where each Cij is a matrix where each elementis either zero or one (a connection matrix)

18


f

Czx

y y

z

x

m F

For mathematical tractability it has to assumed that there exists

))(),(),(()(

),,()(:

mSmKmPmF

zxymFmF

19

Comments on the overall model F

The whole point is that in practise it can be impossible (or impractical) to form explicitly F because it is implicitly defined by the constraint z=C(x) .

This is especially true if the number of subsystems is large or the subsystem models are complex.

20

DECISION UNITS

Coordinator

1 2 N

Upp

er le

vel

Low

er le

vel



…

21

Decision units

For each subsystem i there is a decision unit, whose objective is to control the subsystem according to its own objectives by manipulating the input variables mi.

In this talk it is assumed that the objective of the subsystem is to minimise a real-valued cost function.

The upper level decision unit tries to affect the lower level decision units so that the overall cost function would be minimised, which in this talk is the sum of individual cost functions.

22

The cost functions for lower-level units

More precisely, each subunit attempt to minimise the cost function

iiiii ZYXMg :

or equivalently by using the subsystem model

iii XMG :

where)),(),,(,,(),( 22

iiiiiiiiiiii xmfxmfxmgxmG

23

COORDNINATION

Coordinator

1 2 N

Low

er le

vel



…

24

The cost function for the upper-level

In a similar fashion there exists an overall cost function

ZYXMg :

Using the overall process model this can be equivalently written as

))(),(),(,()( mSmPmKmgmG

and in this talk it is assumed that (is this alwaysthe best choice?)

N

iiii mKmGmG

1

))(,()(

25

The upper-level decision problem

The objective of the coordinator is to affect the lower-level decision making so that the overall cost function G is minimised.

This optimisation problem can be equivalently written as a constrained optimisation problem

N

iiii

mKxFDm

xmG1)(

)(),(min

26

The upper-level decision problem

The construction of the overall optimisation problem G requires the overall system model F, but F is not explicitly available.

Consequently the coordinator cannot check using G if the system has reached its objectives.

Idea: modify the overall optimisation problem so that it can be divided into independent sub-problems, and the coordinator can manipulate the lower-level decision making so that the overall optimality would be achieved.

27

Modification

Let’s define new modified system descriptions

iiiii

iiiii

ZYXMg

ZYXMf

:~:

~

where is an external coordination variable

In a similar fashion let

),,(~

),,,(~

,,(~),,(~

,21 iiiiiiiiiiii xmfxmfxmgxmG

28

The modified sub-system decision process

The decision unit i has to control the sub-system i so that ),,(

~ iii xmG is being minimisedwith a fixed γ

If the solution exists it is called the γ-optimal solution (m(γ),x(γ))

The objective of the coordinator is to find a γ so that the overall cost function is minimised

))(()(min)(

mGmGFDm

29

How the modification should be done?

Whether or not the overall objective is achieved depends on how the modification is done. One straightforward possibility is to select

))(,()),(,(~

))(,()),(,(~

mKmGmKmG

mKmfmKmf

iiiiii

iiiiii

In other words the modified cost function is equal to theoriginal cost function if the interconnection equation x=K(m)is satisfied.

30

Coordinability

Using the modification in the previous slide coordinability can be defined as:

1. The overall optimisation problem has a solution

2. For each γ the the sub-system optimisation problem has a solution, i.e.

3. There exists (at least one) so that

)()(min)(

mGmGFDm

)),(),((~

),,(~

min)(),(

iiiiifDxm

xmGxmGiii

))(()( mGmG

31

Coordination

In practise it is impossible to know immediately the correct coordination parameter

An iterative process is needed where the coordination parameter is updated so that improvement in the overall objective is achieved.

In this case the coordinator needs a coordination strategy which tells how to update

32

Coordination algorithm

1. An initial guess is made for

2. Sub-system decision units solve their optimisation problems, resulting in (m(γ),x(γ))

3. If γ gives the optimal solution, stop. Otherwise update γ the following way:

))(),(,( xmand go to Step 2

33

Coordination algorithm

Select γ

Pro

cess

leve

lC

oord

inat

orS

ub-s

yste

mde

cisi

on u

nit

Solve m(γ),x(γ)

m(γ)

m(γ)

))(),(,( xmNo

Yes

Is optimality achieved

34

Initial thoughts on decomposition

As was defined earlier, the overall optimisation problem can be written as

N

iiii

mKxFDm

xmG1)(

)(),(min

This cost function is separable in the sense that each term Gi(mi,xi) contains only variables from the sub-system i

However, the constraint equation x=K(m) makes the variables dependent, and the problem is not decomposable.

35

The balancing principle

In the balance principle the interactions are removed in order to get a truly decomposable system

The sub-system optimisation is done as a function of mi and xi

As a result the optimal control policy (m,x) does not satisfy the constrain x=K(m) and “balancing” is needed.

36


The sub-system “variables” are modified in the following way:

)),,(),,(,,(~:),,(~

),,(),,,(:),,,,(~),(:),,(

~

21

iiiiiiiiiiii

iiiiiiiiiiiii

iiiiii

xmfxmfxmgxmG

zxzyxmgzyxmg

xmfxmf

where

N

iiii zxzx

1

0),,(:),,(

if and only if (the balance condition)

N

ijiji zCx

1

37


In the balance principle only the cost function is modified and the sub-system model fi remains the same.

The cost function modification is called the zero-sum modification because if the system is in balance, the effect of the modification disappears and the overall performance is just

N

iiii xmGmG

1

),(~

)(

38

Sub-system decision process with balancing

For each subsystem i and given γ find optimal pair (m(γ),x(γ)) so that

)),(),((),,(~

min)(),(

iiiiiifDxm

xmGxmGiii

Define now

),,(~

min)()(),(

iiifDxm

i xmGiii

39

Coordination in balancing

The modified overall cost function can be written as

N

iiiiii

N

iiiiiiiiii

N

iiii

xmfx

xmfxmfxmg

xmG

1

2

1

21

1

)),,(,(

)),(),,(,,(

),,(~

40


Suppose that the overall optimisation problem has a solution and there exits a so that the solution is in balance, i.e.

m ))(),(( xm

))(()( mKx

then (the proof is trivial due the zero-summodification)

N

ii

mG

1

))(()(

41


On the other hand for it is true that

))((

)(),(min

),,(~

min

),,(~

min)(

1)()(),(

1)()(),(

1)(),(

mG

mGxmG

xmG

xmG

N

iiii

mKxfDxm

N

iiii

mKxfDxm

ii

N

ii

fDxm

42


Consequently for all

)()( and the optimisation problem for the coordinatorbecomes

)(max

In practise it often impossible to solve the maximisation problem explicitly – the best one can do is to resort to gradient search

43


In summary1. Set k=0

2. Make an initial guess

3. Solve the sub-system optimisation problems with

4. If the system is balance, stop. If not calculate the gradient of and calculate

γ0=γk

)( k)(11 kkkk

and set k=k+1 and go to 3.

γk

)( k

44

Some remarks on balancing and prediction

During the iterative process the algorithm gives values for γ that do not result in balance – if the balance equations describe for example flows in a chemical process, the inputs the algorithm gives during iterations cannot be used because they do not fulfil physical constraints: not suitable for on-line applications!!!

An alternative method called the prediction method will always give a γ that satisfies the constraints. However, it has its own weaknesses and is rarely used in “real life”.

Hybrid methods exists that mix ingredients from the balance method and predictive method.

45

Further remarks on balancing

Already in 1970s it was suggested that the balancing principle could be solved by using a bargaining process.

Preliminary convergence analysis was done by resorting to game theory.

This can be seen as the first attempt to define agents…

46

Balancing wiyth Langrange techniques

Consider now the more general optimisation problem )(min

0)(

vQvR

VSv

The original optimisation problem is recovered if )()(and,

1

mKxvRGQXMVN

ii

where WVRVQ :,: and V,W areare real Banach spaces

47

The Langrange function

Define the Langrange function

)(,)(),( ** vRwvQwvL

where w* is the Langrange multiplier and belongsto the dual space W* of W.

It is easy to show that if there exists **ˆ,ˆ WwVv so that

)ˆ,ˆ()ˆ,(min ** wvLwvLSv

and 0)ˆ( vR then v̂ is the minimising solution

48

The saddle point theorem

Suppose the Langrange function has a saddle point so that**)ˆ,ˆ( WSwv **),( WSwv

)ˆ,()ˆ,ˆ(),ˆ( *** wvLwvLwvL

then 0)(ˆ 1 vRVvSv and

)ˆ()(min1

vQvQSSv

Proof. Omitted

49

The dual function

Consider now the dual function

),(min)( ** wvLwSv

Properties of the dual function

1. is concave and bounded (requires some additional assumptions)

2. It can be shown that

)()(

)0)((wand*

1**

1

vQw

vRVvSWSSv

50

The maximisation theorem

If is a saddle point of the Langrange function L, then

)ˆ()ˆ()(max **

**vQww

Ww

**)ˆ,ˆ( WSwv

Proof. Omitted.

51

Cost function modification with Balancing/Langrange

The overall cost function is modified to be

N

i

N

jjjjijii

N

iiii

xmfCx

xmGxmL

1 1

2

1

),(,

),(),,(

Changing the summation order it can be shown that

N

jiiiijj

iiiiiiii

N

iiii

xmfC

xxmGxmL

xmLxmL

1

2

1

),(,

,),(:),,(

),(),,(

52

Cost function modification with Balancing/Langrange

In other words each term Li depends only on the variables related to the sub-system i plus the Langrange multiplier.

This is modification is a zero-sum modification because when the constraint

N

jjjjiji xmfCx

1

2 ),(

are met, the effect of the Langrange modifier on the overall cost function disappears.

53

A toy example

Consider the system

),(2

),(2

222222

111111

umPumy

umPumy

with a coupling 1221 , mumu The cost function is

22

21

22

2 )2()1(),(1

yymmymG

54

A toy example

It takes two minutes to show that the optimal input is )

10

17,

5

1(ˆ m

The Langrange modified cost function becomes

)()(

)2()1(),(

122211

22

21

22

2

1

mumu

yymmymG

55

A toy example

This results in the decomposed cost functions

21222

222

12112

12

)2(),(

)1(),(

2

11

muymymG

muymymG

The optimal control actions become

2

1

2

2

1

2

1

1

4

8

24

410,

2

4

24

410

u

m

u

m

56

A toy example

The gradient of is just )(

12

21)(mu

mu

and the update law becomes (coordination algorithm)

12

2111

)(

)(

)1(

)1(

mu

muk

k

k

k

k

where k=0.1 (a sophisticated guess)

57

A toy example

It can be shown that the optimal

)5

4,

5

1(ˆ

In the following material the results from simulation of the balance method with Langrange techniques are given.

The initial guess in the simulations is

)10,10()0(ˆ

58

A toy example

59

A toy example

60

A toy example

61

Conclusions

A two-level hierarchical systems theory offers a very general method to optimise the running of hierarchical systems.

A can be applied on a wide range of applications, examples being economics, organisation theory and large-scale processing plants.

62

Conclusions

The theory can be seen as a starting point for decentralised control.

The main idea is to divide the system into “specialised” subsystems and optimise independently the running of these sub-systems.

In order to have harmony a coordinator is needed that affects the decision making of lower-level decision systems so that an overall balance (harmony, satisfaction level etc.) is achieved.

63

Conclusions

In this talk the balancing method was analysed in detail (not suitable for on-line applications)

The balancing was implemented using the well-known Langrange principle.

Unfortunately, the theory is quite mathematical, and utilises the theory of constrained optimisation in general Banach spaces.

Hard for engineers to digest – low success in the industry

64

Conclusions

Also other numerical approaches exists for large-scale systems: a typical example is a transportation problem where the special structure of the problem is utilised so that efficient numerical methods can be found to solve the optimisation problem.

1 Control of Large Scale Systems Jari Hätönen *,** April 2, 2003 * Department of Automatic Control and Systems Engineering, University of Sheffield, UK.

Documents

twolevel hierarchical

system design

large system

overall system

fast dynamics

slow dynamics

subsystem model

control unitthe decision