particle flow for nonlinear filters

Post on 26-Dec-2021

5 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

Transcript

particle flow for

nonlinear filters

Fred Daum

19 June 2012

Copyright © 2012 Raytheon Company. All rights reserved.

Customer Success Is Our Mission is a trademark of Raytheon Company.

1

discrete time nonlinear filter problem

k

k

k

kkkk

z

w

tx

wttxFtx

tat timet vector measuremen

tat time vector noise process

tat time vectorstate)(

),),(()(

k

k

k

1

=

=

=

=+estimate x

given noisy

measurements

k

k

kk

k

kkkk

k

zzz

Z

Ztxp

v

vttxHz

,...,,Z

tsmeasuremen all ofset

given Z tat time x ofdensity y probabilit),(

tat time vector noiset measuremen

),),((

21k

kk

k

k

=

=

=

=

=

2

many applications of filtering & prediction

roboticsnavigation communicationscontrol of chemical,

mechanical, electrical & nuclear plants

guidance

tracking missiles, satellites, aircraft, ground vehicles,

artillery shells, comets, people, cows, salmon

weather & climate prediction

predicting ionosphere, thermosphere, troposphere

fancy signal processing

science imagingmedicine (e.g., MRI, surgery, pace makers,

drug design, diagnosis)

fancy signal processing

(e.g., MIMO radar, MIMO sonar, MIMO comm, MIMO nav)

oil & mineral exploration

financial engineering adaptive antennasaudio & video signal

processing

Bayesian decisions multi-sensor data fusion compressive sensing other….

3

type of

nonlinear

filter

statistics computed computational

complexity

estimation

accuracy

representation

of probability

density

extended

Kalman filters

mean vector &

covariance matrix d³

sometimes good

but often highly

suboptimal

mean vector &

covariance matrix

unscented

Kalman filters

mean vector &

covariance matrix d³

sometimes better

than EKF but

sometimes worse

mean vector &

covariance matrix

batch least

squares

mean vector &

covariance matrix d³

sometimes better

than EKF but

sometimes worse

mean vector &

covariance matrixd³sometimes worse

numerical

solution of

Fokker-Planck

PDE

full conditional

probability density of

statecurse of

dimensionality

optimal* points in state space

and/or smooth

functions

particle filters

full conditional

probability density of

state

curse of

dimensionality

optimal* particles

exact recursive

filters (Kalman,

Beneš, Daum,

Wonham, Yau)

full conditional

probability density of

state

polynomial in d

(for special

problems)

optimal

(for special

problems)

sufficient

statistics

4

dimension of

the state vector

extended

Kalman

filters

particle flow

filters

10

100

5

nonlinearity or non-Gaussianity

standard particle filters

filters

1

10

exact flow filter is many orders of magnitude faster per

particle than standard particle filters

- - - - -

bootstrap

EKF proposal

incomp flow

exact flow

3

104

105

106

107

Med

ian

Co

mp

uta

tio

n T

ime

fo

r 30 U

pd

ate

s (

sec)

d = 30

d = 20

d = 10

d = 5 bootstrap

particle filter

EKF proposal

* Intel Corel 2 CPU, 1.86GHz, 0.98GB of RAM, PC-MATLAB version 7.7

25 Monte Carlo trials10

210

310

410

510

-1

100

101

102

103

Number of Particles

Med

ian

Co

mp

uta

tio

n T

ime

fo

r 30 U

pd

ate

s (

sec)

6

exact flow

EKF proposal

incompressible

flow

particle flow filter is many orders of magnitude faster

real time computation (for the same or better

estimation accuracy)

3 or 4 orders of

3 or 4 orders of magnitude faster

per particle

avoids bottleneck in

many orders of

magnitude faster

3 or 4 orders of magnitude

fewer particles

bottleneck in parallel

processing due to resampling

7

10-1

100

101

102

EKF

PF Incompressible

new filter improves accuracy by

two orders of magnitude

median error

N = 500 particles

extended Kalman filter

standard particle filter

0 2 4 6 8 1010

-3

10-2

10

Time (sec)

PF Incompressible

PF Ax+BN = 500 particles

new filter

key idea: small curvature flow

(inspired by fluid dynamics) to make

solution of PDE div(pf) = η much faster

Euler’s equations:

3121233

2313122

1232311

)(

)(

)(

MIII

MIII

MIII

=−+

=−+

=−+

ωωω

ωωω

ωωω

&

&

&

8

Why engineers like particle filters:

• Very easy to code

• Extremely general dynamics & measurements: nonlinear & non-Gaussian

• Optimal estimation accuracy (if you use enough • Optimal estimation accuracy (if you use enough particles….)

• You don’t need to know anything about stochastic differential equations or any fancy numerical methods for solving PDEs

• Some people (erroneously) think that PFs beat the curse of dimensionality

9

curse of dimensionality for

classic particle filter

optimal

accuracy:

r = 1.0

10

prediction of

conditional

probability

density from

tk-1 to tk

nonlinear filter

measurements

:rule Bayes'

solution of

Fokker-Planck

equation

),(),(),(

:rule Bayes'

1 kkkkkk txzpZtxpZtxp −=

11

particle degeneracy*

likelihood of

measurementprior

density

particles to represent the prior

12

*Daum & Huang, “Particle degeneracy: root cause & solution,” SPIE Proceedings 2011.

chicken & egg problem

How do you pick a

good way to represent

the product of two

functions before you functions before you

compute the product

itself?

13

induced flow of particles

for Bayes’ rule

pdf pdfflow of density

prior posterior

)(log)(log),(log xhxgxp λλ +=

particles particles

flow of particles

sample from

density

sample from

density

λ=0 λ=1

)(log)(log),(log xhxgxp λλ +=

),( λλ

xfd

dx=

14

derivation of PDE for exact particle flow:

∂−=

∂−=

=

λλ

λ

λ

λ

λλ

x

pfTrxp

xp

x

pfTr

xp

xfd

dx

)(),(

),(log

)(),(

),( Fokker-Planck

equation with zero

diffusion

definition

of p(x,λ)

−−=

=

−=

∂−

−+=

∂∂

λ

λλη

η

λλ

λ

λλλ

λ

d

Kdxhxp

pfdiv

pfdivxpK

xh

Kxhxgxp

x

)(log)(log),(

)(

)(),()(log

)(log

)(log)(log)(log),(log

15

definition

of η

PDE for

f given p

linear first order highly underdetermined PDE:

d

d

x

q

x

q

x

q

d

Kdxhxpx

xxqdiv

∂++

∂+

∂=

−−=

=

...

)(log)(log),(),(

),()),((

2

2

1

λ

λλλη

ληλ

like Gauss’ divergence

law in electromagneticsdxxx ∂∂∂ 21

function values are

only known at

random points in d-

dimensional space

16

q = pf

f = unknown function

p & η = known at random points

We want dx/dλ = f(x,λ)

to be a stable dynamical

system.

law in electromagnetics

likelihood of

measurementprior

densityoptimal

accuracyg h

root cause of

curse of dimensionality:

curse of dimensionality:

particles to represent the prior

pdf pdf

particles particles

flow of density

flow of particles

sample from

density

sample from

density

λ=0 λ=1

prior posterior

)(log)(log),(log xhxgxp λλ +=

),( λλ

xfd

dx=

f.for PDE above thesolving

by flow particle design the We

loglog)(

+−=

λd

Kdhppfdiv

17

method to solve PDE how to pick unique solution comments

1. generalized inverse of linear differential operator minimum norm* very difficult to design robust stable & fast algorithm

2. Poisson’s equation gradient of potential*(assume irrotational flow)

very difficult to design robust stable & fast algorithm

3. generalized inverse of gradient of log-homotopy assume incompressible flow & pick minimum L² norm solution

workhorse for multimodal densities

4. generalization of method #3 most robustly stable filter or random pick, etc.

workhorse for multimodal densities

5. separation of variables (Gaussian) pick solution of specific form (polynomial) extremely fast & hard to beat in accuracy for many problems

6. separation of variables

(exponential family)

pick solution of specific form (finite basis functions)

needs theoretical work & numerical experiments

7. variational formulation (Gauss & Hertz) convex function minimization needs work

8. optimal control formulation convex functional minimization (e.g., least action like Monge-Kantorovich)

very high computational complexityaction like Monge-Kantorovich)

9. direct integration (of first order linear PDE in divergence form)

choice of d-1 arbitrary functions should work with enforcement of neutral charge density & importance sampling

10. generalized method of characteristics more conditions (e.g., small curvature or specify curl, or use Lorentz invariance)

needs theoretical work & numerical experiments

11. another homotopy (inspired by Gromov’s

h-principle) like Feynman’s QED perturbation

initial condition of ODE &

uniqueness of sol. to ODE

needs theoretical work & numerical experiments

12. finite dimensional parametric flow

(e.g., f = Ax+b with A & b parameters)

non-singular matrix to invert needs numerical experiments

13. Fourier transform of PDE (divergence form of linear PDE has constant coefficients!)

minimum norm* or most stable flow very difficult to design robust stable & fast algorithm

14. small curvature flow & assumed prior density solve d x d system of linear equations new in 2012. Beats other methods for difficult nonlinear problems.

15. small curvature flow & homotopy for inverse of A + B (sum of two linear operators)

numerically integrate ODE new in 2012. extremely cool theory.

16. small curvature flow & homotopy for generalized inverse of A + B (sum of two linear operators)

numerically integrate ODE new in 2012

exact particle flow for Gaussian densities:

fx

pfdiv

d

Kdh

xfd

dx

∂−−=−

=

:exactly ffor solvecan weGaussian,h & gfor

log)(

)(log)log(

),(

λ

λ

λλ

[ ]

( ) ( )[ ]xAzRPHAIAIb

HRHPHPHA

bAxf

T

TT

+++=

+−=

+=

1

1

2

2

1

:exactly ffor solvecan weGaussian,h & gfor

λλ

λ

19

automatically stable

under very mild

conditions &

extremely fast

incompressible particle flow

−=

gradient zero-nonfor

),(log

))(log(dx

2

x

xp

xh

T

λ

λ

∂−

=

otherwise 0

gradient zero-nonfor ),(log

))(log(

d

dx 2

x

xpxh

λλ

20

irrotational particle flow:

∫ −

=

≥−

−=

=

∂==

d

T

dyc

dyyx

cyxV

xx

xVTr

xpx

xVxf

d

dx

)logK(

-logh(y))p(y,)V(x,

3 dfor ),(),(

),(),(

),(/),(

),(

2

2

2

λλλ

ληλ

ληλ

λλ

λλ Poisson’s

equation

−−

∂−≈

−−

∂−=

−−

∂−=

− ∂

=

iSjd

ji

T

ji

ji

d

T

d

T

d

xx

xxdcKxh

Mx

xV

yx

yxdcKyhE

x

xV

dyyx

yxdcKyhyp

x

xV

dyyx

))(2()

)(log)((log

1),(

))(2()

)(log)((log

),(

))(2()(log)(log),(

),(

-logh(y))p(y,)V(x,2

λ

λλ

λ

λλ

λ

λλ

λ

λλλ

21

like

Coulomb’s

law

0 5 10 15 20 25 30 35 400

100

200

300N = 1000

An

gle

Err

or

(de

g)

EKF

PF

0 5 10 15 20 25 30 35 400

5

10

15

Time

An

gle

Ra

te E

rro

r (d

eg

/se

c)

EKF

PF

22

100

200

300

400Time = 1, Frame 1

An

gle

Ra

te (

de

g/s

ec)

initial probability distribution of particles:

λ = 0.0

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

23

100

200

300

400Time = 1, Frame 2

An

gle

Ra

te (

de

g/s

ec)

λ = 0.1

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

24

100

200

300

400Time = 1, Frame 2

An

gle

Ra

te (

de

g/s

ec)

λ = 0.2

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

25

100

200

300

400Time = 1, Frame 3

An

gle

Ra

te (

de

g/s

ec)

λ = 0.3

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

26

100

200

300

400Time = 1, Frame 4

An

gle

Ra

te (

de

g/s

ec)

λ = 0.4

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

27

100

200

300

400Time = 1, Frame 5

An

gle

Ra

te (

de

g/s

ec)

λ = 0.5

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

28

100

200

300

400Time = 1, Frame 6

An

gle

Ra

te (

de

g/s

ec)

λ = 0.6

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

29

100

200

300

400Time = 1, Frame 7

An

gle

Ra

te (

de

g/s

ec)

λ = 0.7

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

30

100

200

300

400Time = 1, Frame 8

An

gle

Ra

te (

de

g/s

ec)

λ = 0.8

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

31

100

200

300

400Time = 1, Frame 9

An

gle

Ra

te (

de

g/s

ec)

λ = 0.9

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

32

100

200

300

400Time = 1, Frame 10

An

gle

Ra

te (

de

g/s

ec)

final probability distribution of particles (resulting from

one noisy measurement of sin(θ) with Bayes’ rule):

λ = 1

-200 -150 -100 -50 0 50 100 150 200-300

-200

-100

0

Angle (deg)

An

gle

Ra

te (

de

g/s

ec)

33

induced flow of particles

for Bayes’ rule

pdf pdfflow of density

prior posterior

)(log)(log),(log xhxgxp λλ +=

particles particles

flow of particles

sample from

density

sample from

density

λ=0 λ=1

)(log)(log),(log xhxgxp λλ +=

),( λλ

xfd

dx=

34

103

104

d = 12, ny = 3, y = x2, SNR = 20dB

Dim

en

sio

nle

ss E

rro

r

extended Kalman filter (EKF)

particle filter beats the

EKF by two orders of

magnitude in accuracy

102

103

104

105

102

Dim

en

sio

nle

ss E

rro

r

Number of Particles

EKF

PF

quadratic measurement

nonlinearity 35

particle filter

magnitude in accuracy

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

36

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

37

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

38

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

39

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

40

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

41

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

42

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

43

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

44

0

0.2

0.4

0.6

0.8

1x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

x 105

-1

-0.8

-0.6

-0.4

-0.2

45

105

106

107

d = 12, ny = 3, y = x

3, SNR = 20dB

Dim

en

sio

nle

ss

Err

or

particle filter beats the

EKF by two orders of

magnitude in accuracy

extended Kalman filter (EKF)

102

103

104

105

103

104

Dim

en

sio

nle

ss

Err

or

Number of Particles

EKF

PF

46

magnitude in accuracy

particle filter

cubic measurement

nonlinearity

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

47

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

48

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

49

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

50

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

51

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

52

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

53

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

54

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

55

0

0.5

1

1.5x 10

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5 2

x 105

-1.5

-1

-0.5

56

direct integration of PDE

∂++

∂+

∂=

=

d

d

x

q

x

q

x

q

xxqdiv

...

),()),((

:PDE theof form divergence theuse

2

2

1

ληλ

∫ −+=

=

=

==

∂∂∂

jx

j

d

dxxx

q

xxx

),(~),(q~q

q~ q ofcomponent onebut allpick

~-)q~-div(q

~)q~div( problem related ofsolution exact ~

jj

21

ληλη

ηη

η

57

most general solution for exact flow:

)(log-Cf

:issolution generalmost the

log)()log(

),(

## −+=

∂−−=

=

yCCIh

fx

pfdivh

xfd

dxλ

λ

fundamental

PDE for exact

particle flow

vectorldimensiona-darbitrary y

C of inverse dgeneralize

x

logpC

:operator aldifferentilinear a is Cin which

)(log-Cf

#

##

=

=

+∂

∂=

−+=

C

div

yCCIh

could pick y to

robustly stabilize the

filter or random or

other

58

computing the generalized inverse of A + B:

#

#

#

0)(

BA)C( :homotopy a define

case) in this isit but truegenerally (not 1CC

C of inverse pseudoC

B A C define

d

CCd=

+=

=

=

+=

µ

µµpick A that is easy

to invert & pick

any B (i.e., could

( ) #####

###

#

##

0

0

BCCCCd

dC

Cd

dCC

d

dCCC

Cd

dC

d

dCC

d

−=

−=

=+

=

µ

µµ

µµ

µ

59

solve by numerical

integration from

µ = 0 to 1

any B (i.e., could

be hard to invert)

nonlinear filter performance (accuracy wrt

optimal & computational complexity)

DIMENSIONprocess noise

initial uncertainty

of state vector

measurementexploit

smoothness

sparseness

measurement

noise

stability

& mixing

of dynamics

multi-modal

nonlinearityill-conditioning

quality of

proposal density

concentration

of measure

exploit

structure (e.g.

exact filters)

60

variation in initial uncertainty of x

1010

1015

1020

Dim

en

sio

nle

ss E

rro

r

N = 1000, Stable, d = 10, Quadratic

Huge Initial Uncertainty

Large Initial Uncertainty

Medium Initial Uncertainty

Small Initial Uncertainty

, λλλλ = 0.6

25 Monte Carlo Trials

0 5 10 15 20 25 3010

-5

100

105

Time

Dim

en

sio

nle

ss E

rro

r

61

variation in eigenvalues of the plant (λ)

106

108

1010

1012

Dim

en

sio

nle

ss E

rro

r

N = 1000, d = 10, Cubic

λλλλ = 0.1

0.5

1.0

1.1

1.2

25 Monte Carlo Trials

0 5 10 15 20 25 3010

-2

100

102

104

Time

Dim

en

sio

nle

ss E

rro

r

62

variation in dimension of x

106

108

1010

1012

Dim

en

sio

nle

ss E

rro

r

N = 1000, λλλλ = 1.0, Cubic

Dimension = 5

10

15

20

25 Monte Carlo Trials

0 5 10 15 20 25 3010

-2

100

102

104

Time

Dim

en

sio

nle

ss E

rro

r

63

numerical results so far

applications:

• long range wideband radar

tracking TBMs & ICBMs

• angles only tracking (multiple

radars & 1 radar)

key parameters:

• dimension of state vector of

dynamical model (plant)

• initial uncertainty in the state

vectorradars & 1 radar)

• discrimination for wideband

radar (BMD)

• linear systems

• quadratic nonlinearity

• cubic nonlinearity

• cosine nonlinearity

• Euler’s equations (6 DOF)

vector

• stability of the plant

• signal-to-noise ratio of

measurements

• trajectory of objects

• radar-object geometry

64

particle flow filter is many orders of magnitude faster

real time computation (for the same or better

estimation accuracy)

3 or 4 orders of

3 or 4 orders of magnitude faster

per particle

avoids bottleneck in

many orders of

magnitude faster

3 or 4 orders of magnitude

fewer particles

bottleneck in parallel

processing due to resampling

65

many new ideas

direct integration of PDE but enforce

neutral charge & exploit known exact solutions

separation of variables

(exponential family & arbitrary densities)

method of characteristics (take

gradient to get d eqs in d unknowns)

small curvature flow (take gradient to get d eqs. in d unknowns)

variance reduction for Poisson’s eq exploiting

f = Ax+b or other

log speed & unit vector for Coulomb’s law

inverse or generalized inverse of A+B like Feynman’s QED

perturbation theory

Gelman & Mengoptimal flow

Lagrange’s trick assumed densities Lagrange’s trick

Gromov’s h-principle

renormalization group flow

assumed densities (Gauss-arbitrary,

exponential family-arbitrary, Gauss-

exponential family, etc.)

Le Gland more general homotopy than log-

homotopy (e,g, log-log)

harvest ideas from Monge-Kantorovich

optimal trasnport

big/small parameter approximations (far field, narrow band, steady state, large

number of particles, 1/137, etc.)

parametric flow to enforce stability

non-unique solutions of PDE (mixed in x, λ &

components of x)

Galerkin or

homotopy-Galerkin or differential quadrature

Q ≥ 0 with Dirac approximation to solve

Fokker-Planck PDE

Q ≥ 0 with Daum exact solutions of Fokker-

Planck (1986)

HYBRIDS

of above

obtain unique solution to Gauss div law by enforcing Lorentz invariance (like Maxwell’s eqs) 66

small curvature particle flow:

x

f

x

pf

x

p

x

fdiv

fx

pfdiv

d

Kdh

+=

∂−

∂−

∂−=

∂−−=−

:flow) ibleincompress and bAx f (and flow curvature smallfor

loglog)(

x

logh

:PDE above ofgradient thecompute

log)(

)(log)log(

2

2

λ

λ

T

x

f

x

p

x

h

x

pf

∂+

∂−=

=∂

+=

logloglog

:hence

0x

div(f)

:flow) ibleincompress and bAx f (and flow curvature smallfor

1

2

2

67

• extremely fast to compute

• Hessian of logp is

always non-singular

• similar to Fisher matrix

• generalizes our two

favorite flows!

10-1

100

101

102

EKF

PF Incompressible

new filter improves accuracy by

two orders of magnitude

median error

N = 500 particles

extended Kalman filter

standard particle filter

0 2 4 6 8 1010

-3

10-2

10

Time (sec)

PF Incompressible

PF Ax+BN = 500 particles

new filter

key idea: small curvature flow

(inspired by fluid dynamics) to make

solution of PDE div(pf) = η much faster

Euler’s equations:

3121233

2313122

1232311

)(

)(

)(

MIII

MIII

MIII

=−+

=−+

=−+

ωωω

ωωω

ωωω

&

&

&

68

0div(f)

:flow ibleincompress

=)b()xA(f

:flowGaussian

+= λλ

small curvature flow:

69

0div(f) =Tr(A)div(f)

)b()xA(f

=

+= λλ

0)(

=∂

x

fdiv

small curvature flow (even better):

T

x

f

x

p

x

pf

x

fdiv

fx

pfdiv

d

Kdh

+=

∂−

∂−

∂−=

∂−−=−

:flow) ibleincompress and bAx f (and flow curvature smallfor

loglog)(

x

logh

:PDE above ofgradient thecompute

log)(

)(log)log(

2

2

λ

λ

T

x

h

xx

p

x

pf

∂+

∂−=

=∂

+=

logloglog

:hence

0x

div(f)

:flow) ibleincompress and bAx f (and flow curvature smallfor

1

2

2

70

computing the inverse of A + B:

1

1-

1-

0)(

BA)G( :homotopy a define

GG

G of inverse G

B A G define

=

+=

=

=

+=

d

GGd

I

µ

µµ

pick A that is easy

to invert & pick

A & B are linear operators (e.g., matrices or

differential operators)

111

111

1

11

0

0

−−−

−−−

−−

−=

−=

=+

=

BGGd

dG

Gd

dGG

d

dGGG

Gd

dG

d

dGG

d

µ

µµ

µµ

µ

solve by numerical integration from

µ = 0 to 1with the obvious initial condition

to invert & pick any B (i.e., could

be hard to invert)

71

new nonlinear filter: particle flow

new particle flow filter standard particle filters

many orders of magnitude faster than

standard particle filters

suffers from curse of dimensionality

3 to 4 orders of magnitude faster per

particle

suffers from “particle degeneracy”

3 to 4 orders of magnitude fewer particles

required to achieve optimal accuracy

requires millions or billions of particles

for high dimensional problems

Bayes’ rule is computed using particle

flow (like physics)

Bayes’ rule is computed using a pointwise

multiplication of two functions

no proposal density depends on proposal density (e.g.,

Gaussian from EKF or UKF)

no resampling of particles resampling is needed to repair the damage

done by Bayes’ rule

embarrassingly parallelizable suffers from bottleneck due to resampling

computes log of unnormalized density suffers from severe numerical problems

due to computation of normalized density72

History of Mathematics

1. Creation of the integers

2. Invention of counting

3. Invention of addition as a fast

method of countingmethod of counting

4. Invention of multiplication as a

fast method of addition

5. Invention of particle flow as a

fast method of multiplication*

73

Fred Daum, Jim Huang & Arjang Noushin, “exact particle flow for nonlinear filters,” Proceedings of SPIE Conference, Orlando Florida,

April 2010.

Fred Daum & Jim Huang, “particle degeneracy: root cause & solution,” Proceedings of SPIE Conference, Orlando Florida, April 2011.

Fred Daum & Jim Huang “numerical experiments for nonlinear filters with exact particle flow induced by log-homotopy,” Proceedings of SPIE Conference, Orlando Florida, April 2010.Conference, Orlando Florida, April 2010.

Fred Daum & Jim Huang, “exact particle flow for nonlinear filters: seventeen dubious solutions to a linear first order underdetermined PDE,” Proceedings of IEEE Conference, Asilomar California, November 2010.

Fred Daum, Jim Huang & Arjang Noushin, “Coulomb’s law particle flow for nonlinear filters,” Proceedings of SPIE Conference, San Diego California, August 2011.

74

BACKUPBACKUP

75

fluid dynamics

electro-magnetics

plasma physics

quantum field

theorybig bang

76

1. derive PDE

2. solve PDE

77

3. test solution

physics & nonlinear filters

Metropolis algorithm &

particle filters

PDEs

(Fokker-Planck, Gauss div law, Maxwell’s eqs,

continuity equation, Einstein field eqs., etc.)

Beneš advice:

“think like a fluid mechanic”

Coulomb’s law &

particle flow

path integrals for NLF (Pierre Del

Moral & Bhashyam Balaji & Mike Schilder, et al.)

Why is the maximum speed of particles c ?

incompressible flow

irrotational flow

small curvature flow

Dan Zwillinger’s

advice: “think like a physicist”

Lie algebras & gauge theory for NLF

(Brockett & Mitter & Marcus & Ocone et al.)

1/N

Dudley Herschbach

Mark Kac & Ed Witten

BIG BANG

inflation

cosmic acceleration

quantization of physical particles & particles in

filters also!

(via resampling)

renormalization group flow & particle flow

(Bhashyam Balaji)

Hamiltonian Monte Carlo

“the physicists are always 20 years

ahead of us”

Prof. Jun Liu

strings vs. particles

(Francois Le Gland 2001)

78

numerical results so far

applications:

• long range wideband radar

tracking TBMs & ICBMs

• angles only tracking (multiple

radars & 1 radar)

key parameters:

• dimension of state vector of

dynamical model (plant)

• initial uncertainty in the state

vectorradars & 1 radar)

• discrimination for wideband

radar (BMD)

• linear systems

• quadratic nonlinearity

• cubic nonlinearity

• cosine nonlinearity

• Euler’s equations (6 DOF)

vector

• stability of the plant

• signal-to-noise ratio of

measurements

• trajectory of objects

• radar-object geometry

79

104

d = 12, ny = 3, y = x2, SNR = 20dB

Dim

en

sio

nle

ss E

rro

r

quadratic measurement nonlinearity

extended Kalman filter (EKF)

particle filter beats the

102

103

104

105

102

103

Dim

en

sio

nle

ss E

rro

r

Number of Particles

EKF

PF

particle filter

particle filter beats the

EKF by two orders of

magnitude in accuracy

80

106

107

d = 12, ny = 3, y = x

3, SNR = 20dB

Dim

en

sio

nle

ss

Err

or

cubic measurement nonlinearity

particle filter beats the

EKF by two orders of

magnitude in accuracy

extended Kalman filter (EKF)

102

103

104

105

103

104

105

Dim

en

sio

nle

ss

Err

or

Number of Particles

EKF

PF

magnitude in accuracy

particle filter

81

0 5 10 15 20 25 30 35 400

100

200

300N = 1000

An

gle

Err

or

(de

g)

EKF

PF

nonlinear measurement (sine function of θ)

0 5 10 15 20 25 30 35 40

0 5 10 15 20 25 30 35 400

5

10

15

Time

An

gle

Ra

te E

rro

r (d

eg

/se

c)

EKF

PF

82

100

101

102

Euler’s equations of rotational motion

with wideband range measurements

median

error in

state

vector

extended Kalman filter

standard particle filter

new filter

0 2 4 6 8 1010

-3

10-2

10-1

Time (sec)

EKF

PF Incompressible

PF Ax+B

N = 500 particles

50 Monte Carlo runs

new particle filter

new filter

improves

accuracy by two

orders of

magnitude

83

103

104

105

Ve

loc

ity

Err

or

(m/s

ec

)

N = 100, σσσσr = 100m

EKF

Inexact Flow

Exact Flow

Exact Flow with Redraw

radar tracking ballistic missile

(d =6 & N = 100 particles)

incompressible

flow

0 20 40 60 80 10010

0

101

102

Time (sec)

Ve

loc

ity

Err

or

(m/s

ec

)

exact

flow

EKF

84

103

104

km

median range estimation error (angles only data)

0 50 100 150 200 250 300 350 40010

1

102

Time (sec)

km

EKF: solid

Log Homotopy PF: dashed

angle = 90 deg

angle = 75 deg

angle = 25 deg

angle = 10 deg

both EKF & PF

use modified spherical

coordinates

85

103

104

angle = 10 deg

median range estimation error (angles only data)

extended Kalman

filter

0 50 100 150 200 250 300 350 40010

1

102

Time (sec)

km

both EKF & PF

use modified spherical

coordinates

exact flow filter

86

100

101

102

3D position error, angle 1 sigma = 0.1 mrad

Radar separation = 0 m

10 m

100 m

1000 m

two radars air targets angles only

0 20 40 60 80 100 12010

-4

10-3

10-2

10-1

Time (sec)

km

excellent

accuracy

from two

radars

87

100

101

102

3D position error, angle 1 sigma = 0.1 mrad

Radar separation = 1000 m

2000 m

5000 m

10000 mexcellent

accuracy

from two

radars

two radars air targets angles only

0 20 40 60 80 100 12010

-4

10-3

10-2

10-1

Time (sec)

km

88

100

101

102

3D position error, angle 1 sigma = 1.0 mrad

Radar separation = 0 m

10 m

100 m

1000 m

two radars air targets angles only

0 20 40 60 80 100 12010

-3

10-2

10-1

Time (sec)

km

excellent

accuracy

from two

radars

89

100

101

102

Euler’s equations of rotational motion

with wideband range measurements

median

error in

state

vector

extended Kalman filter

standard particle filter

new filter

0 2 4 6 8 1010

-3

10-2

10-1

Time (sec)

EKF

PF Incompressible

PF Ax+B

N = 500 particles

50 Monte Carlo runs

new particle filter

new filter

improves

accuracy by two

orders of

magnitude

90

104

106

108

Dim

en

sio

nle

ss E

rro

r

Stable, d = 30, Linear

EKF

Inexact Flow

Exact Flow

Exact Flow with Redraw

only need N = 100

particles for optimal

accuracy for d = 30

102

103

104

100

102

Number of Particles

Dim

en

sio

nle

ss E

rro

r

accuracy for d = 30

dimensional problem

91

exact flow: performance vs. number of particles

101

102

Dim

en

sio

nle

ss E

rro

r A

fter

30 U

pd

ate

s

λλλλ = 1.2, Linear, Large Initial Uncertainty

Dimension = 5

10

15

20

30extremely

unstable

plant

25 Monte Carlo Trials

102

103

104

10-1

100

Number of Particles

Dim

en

sio

nle

ss E

rro

r A

fter

30 U

pd

ate

s

92

93

item exact particle flow Monge-Kantorovich

optimal transport

1. purpose fix particle degeneracy due

to Bayes’ rule

move physical objects with

minimal effort from p1 to p2

2. conservation of

probability mass along flow

yes yes

3. deterministic yes* yes

4. homotopy of density no yes

5. log-homotopy of

density

yes no

6. optimality criteria none minimum action, etc.6. optimality criteria none minimum action, etc.

7. how to pick a solution a dozen distinct methods minimum action, etc.

8. stability of flow explicitly

considered

yes rarely

9. high dimensional

applications solved

successfully

yes (d ≤ 30) no (d = 1, 2 or 3)

10. computational

complexity

numerical integration of

ODE for each particle

solution of Monge-Ampere

nonlinear PDE or other PDE

11. solution of PDE for nice

special cases

many (incompressible,

irrotational, Gaussian, etc.)

only irrotational94

fast Ewald’s method vs. Coulomb’s law

item fast Ewald method

in physics &

chemistry*

Coulomb’s law

with fast k-NN

comments

1. dimension of x d = 3 d = 3 to 30 rapid decay of

Coulomb kernel in

higher d helps!

2. relative error

desired

0.0001 or better 1% to 10% all Ewald methods the

same for 1% accuracy

3. cut-off in x space fixed distance random per k-NN automatic space-taper

to weight convolution

4. desired force on mesh at particles big difference!

5. neutral charge locally enforced locally enforced crucial

6. smoothing charge Gaussian no explicit smoothing

7. k-space or real-

space

both real space (x) no FFT needed for

Coulomb

95

*Shan, Klepeis, Eastwood, Dror & Shaw, “Gaussian split Ewald: a fast Ewald mesh

method for molecular simulation,” Journal of Chemical Physics, 2005.

new exact particle flow: operator-valued homotopy

Improved version of incompressible flow (does not assume that div(f) = 0 and makes no other assumptions about the problem)

Avoids computing p(x,λ) or dividing by p (unlike Coulomb’s law), but rather it uses the gradient of the log of p(x,λ)

Does not depend on EKF or UKF

Completely general (highly non-Gaussian multimodal densities and compressible flow)

Compute the flow dx/dλ using the generalized inverse of the sum of two linear differential operators (A + B) using an ODE derived with a homotopy (similar to matrix inversion using ODE derived by homotopy); exploits easy computation of generalized inverse of A; A = gradient of log p(x,λ) as in incompressible flow & B = divergence

Inspired by: (1) Feynman’s perturbation approximation for QED but we do not approximate by exploiting small parameter 1/137 (see Kaku‘s book on QFT page 137) and (2) homotopy for matrix inverse and (3) generalized inverse of sum of two matrices in certain special cases (see page 50 in Campbell & Meyer book), as well as (4) Gromov’s h-principle; but our algorithm is new & distinct from these methods.

96

105

106

107

d = 12, ny = 3, y = x

3, SNR = 20dB

Dim

en

sio

nle

ss

Err

or cubic measurement

nonlinearity

102

103

104

105

103

104

Dim

en

sio

nle

ss

Err

or

Number of Particles

EKF

PF

97

finite dimensional parametric approximation:

2

2

)()(x

logplog(h)J

)b()xA()f(x,let

)(log

)log(

rATrbAx

rfdivfx

phJ

+++∂

∂+=

+≈

++∂

∂+=

λλλ

98

3

6

j

T

donly toreduced becan thisA sparse

forbut ,d is complexity nalcomputatio however, )5(

)(Tr(A) div(f) that note (4)

stablerobustly flow make penalty to add also could (3)

flow ofstability force to-BBAlet could (2)

xparticleeach at J minimize tob & Afor solve )1(

x

A∑==

=

λ

:flow)exact our for usual (as EKFan usingGaussian

h& gith solution wexact our from computed are b &A in which

)c(x,)b()xA()f(x,let ,generality of lossWithout

)(log)(x

)logp(x,

:PDE following thesatisfies that )f(x, flow a find want toWe

xhfdivf

++=

−=+∂

λλλλ

λ

λ

hybrid method:

2#

#

#

log/

loglog

:(.) of inverse-pseudo thedenotes (.)in which

)log()()(log

x

logpc

:)c(x, norm minimum thecompute and 0,div(c) that Assume

)log()()()(x

logp

x

p

x

p

x

p

hATrbAxx

p

hcdivATrcbAx

T

∂=

+++

∂−≈

−=++++∂

λ

99

recall derivation of incompressible flow:

∂−−=−

=

λ

λ

λλ

fx

pfdiv

d

Kdh

xfd

dx

log)(

)(log)log(

),(

∂−=

=

λ

λ

λ

d

Kdh

x

pf

xd

)(loglog

log

:inverse-pseudo theusing ffor solve and

0,div(f) ible,incompress is flow that theassume

#

100

recall incompressible particle flow:

2log

log)(log)(log

d

dx

−−

= λ

λ

λ px

p

d

Kdxh

T

gradient zerofor 0

2logd

=

∂=

λ

λ

d

dx

x

p

101

new derivation of compressible flow:

∂−−=−

=

λ

λ

λλ

fx

pfdiv

d

Kdh

xfd

dx

log)(

)(log)log(

),(

+

∂−=

λ

λ

λ

d

Kdhdiv

x

pf

xd

)(loglog

log

:inverse-pseudo theusing ffor solve and

ible,incompress is flow that theassumenot willwe

#

102

particle flow filter

• orders of magnitude faster than standard particle filters

• orders of magnitude more accurate than the extended

Kalman filter for difficult nonlinear problems

• solves particle degeneracy problem using particle flow

induced by log-homotopy for Bayes’ rule

• no resampling of particles• no resampling of particles

• no proposal density

• no importance sampling & no MCMC methods

• unnormalized log probability density

• embarrassingly parallelizable w/o resampling bottleneck

(unlike other particle filters)

• exploits smoothness & regularity of densities

103

effect of divergence of f:

log)()log(

),(

#

fx

pfdivh

xfd

dx

∂−−=

= λλ

suppose

that div(f) is

given

[ ])(loglog

#

fdivhx

pf +

∂−=

effect of div(f)

is to change

the speed of

particle flow

if div(f) = 0

we get

incompressible

flow104

incompressible particle flow

2

log)log(

dx

∂−

= x

ph

T

gradient zerofor 0

2logd

=

∂ ∂=

λ

λ

d

dx

x

px

105

103

104

km

angles only tracking

d = 6 and N = 500 particles

exact flow filter

0 50 100 150 200 250 300 350 40010

1

102

Time (sec)

EKF: Solid

Log Homotopy PF: Dashed

Tincknell Angle = 90 deg

Tincknell Angle = 75 deg

Tincknell Angle = 25 deg

Tincknell Angle = 10 deg

median range error

(100 Monte Carlo runs)

106

103

104

km

Tincknell angle = 90 deg

angles only tracking

d = 6 and N = 500 particles

exact flow filter

0 50 100 150 200 250 300 350 40010

1

102

Time (sec)

km

EKF

Log Homotopy PF

median range error

(100 Monte Carlo runs)

107

103

104

km

Tincknell angle = 75 deg

angles only tracking

d = 6 and N = 500 particles

exact flow filter

0 50 100 150 200 250 300 350 40010

1

102

Time (sec)

km

EKF

Log Homotopy PF

median range error

(100 Monte Carlo runs)

108

103

104

km

Tincknell angle = 25 deg

angles only tracking

d = 6 and N = 500 particles

exact flow filter

0 50 100 150 200 250 300 350 40010

1

102

Time (sec)

km

EKF

Log Homotopy PF

median range error

(100 Monte Carlo runs)

109

103

104

km

Tincknell angle = 10 deg

angles only tracking

d = 6 and N = 500 particles

exact flow filter

0 50 100 150 200 250 300 350 40010

1

102

Time (sec)

km

EKF

Log Homotopy PF

median range error

(100 Monte Carlo runs)

110

exact particle flow for Gaussian densities:

fx

pfdivh

xfd

dx

∂−−=

=

:exactly ffor solvecan weGaussian,h & gfor

log)()log(

),( λλ

[ ]

( ) ( )[ ]xAzRPHAIAIb

HRHPHPHA

bAxf

T

TT

+++=

+−=

+=

1

1

2

2

1

:exactly ffor solvecan weGaussian,h & gfor

λλ

λ

111

automatically stable

under very mild

conditions &

extremely fast

0

0.2

0.4

0.6

0.8Inside = 5 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

112

0

0.2

0.4

0.6

0.8Inside = 10.6 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

113

0

0.2

0.4

0.6

0.8Inside = 13.8 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

114

0

0.2

0.4

0.6

0.8Inside = 16.6 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

115

0

0.2

0.4

0.6

0.8Inside = 17.6 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

116

0

0.2

0.4

0.6

0.8Inside = 21 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

117

0

0.2

0.4

0.6

0.8Inside = 21 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

118

0

0.2

0.4

0.6

0.8Inside = 23.8 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

119

0

0.2

0.4

0.6

0.8Inside = 26.4 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

120

0

0.2

0.4

0.6

0.8Inside = 27.6 percent, Magenta: truth, Green: PF estimate, Black: KF

Ax+

b

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Ax+

b

121

incompressible particle flow

2

log)log(

dx

∂−

= x

ph

T

gradient zerofor 0

2logd

=

∂ ∂=

λ

λ

d

dx

x

px

122

0

0.2

0.4

0.6

0.8Inside = 6 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

123

0

0.2

0.4

0.6

0.8Inside = 5.8 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

124

0

0.2

0.4

0.6

0.8Inside = 8.2 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

125

0

0.2

0.4

0.6

0.8Inside = 9.2 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

126

0

0.2

0.4

0.6

0.8Inside = 11.2 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

127

0

0.2

0.4

0.6

0.8Inside = 11.8 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

128

0

0.2

0.4

0.6

0.8Inside = 12.8 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

129

0

0.2

0.4

0.6

0.8Inside = 12 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

130

0

0.2

0.4

0.6

0.8Inside = 11.6 percent, Magenta: truth, Green: PF estimate, Black: KF

Hessia

n

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

-0.6

-0.4

-0.2

Hessia

n

131

variation in SNR

101

102

Dim

en

sio

nle

ss E

rro

r

N = 1000, Stable, d = 10, Linear

SNR = 10 dB

15 dB

20 dB

30 dB

, λλλλ = 0.6

25 Monte Carlo Trials

0 5 10 15 20 25 3010

-1

100

Time

Dim

en

sio

nle

ss E

rro

r

132

variation in process noise

103

104

105

106

107

Dim

en

sio

nle

ss E

rro

r

N = 1000, Stable, d = 10, Quadratic

σσσσq = 0.01X

0.1X

1X

10X

100X

, λλλλ = 0.6

25 Monte Carlo Trials

0 5 10 15 20 25 3010

-1

100

101

102

Time

Dim

en

sio

nle

ss E

rro

r

133

particle flow filter

• orders of magnitude faster than standard particle filters

• orders of magnitude more accurate than the extended

Kalman filter for difficult nonlinear problems

• solves particle degeneracy problem using particle flow

induced by log-homotopy for Bayes’ rule

• no resampling of particles• no resampling of particles

• no proposal density

• no importance sampling & no MCMC methods

• unnormalized log probability density

• embarrassingly parallelizable w/o resampling bottleneck

(unlike other particle filters)

• exploits smoothness & regularity of densities

134

fundamental PDE for exact particle flow:

x

pfTrxp

xp

x

pfTr

xp

xfd

dx

∂−=

∂−=

=

)(),(

),(log

)(),(

),(

λλ

λ

λ

λ

λλ

Fokker-Planck

equation with Q = 0

assume

log-homotopy

fx

pfdivh

fx

p

x

fTrxpxpxh

xhxgxp

xTrxp

∂−−=

∂−

∂−=

+=

∂−=

log)()log(

),(),()(log

)(log)(log),(log

),(

λλ

λλ

λλ

log-homotopy

first order linear

underdetermined

PDE in f(x,λ)135

direct integration of fundamental PDE:

...

),(),(

)),((

:PDE theof form divergence theuse

2

2

1

1

∂−=

∂++

∂+

∂=

=

∂=

∑≠

η

η

ληλ

λ

dkj

d

d

x

q

x

q

x

q

x

q

x

q

xx

xqTrxqdiv

pick for best

stability of

particle flow

[ ]

.0(x)dx iff

exists )q(x,for solution a , and on conditions regularity Assuming

)conditionsity compatibilfor (except function arbitrary )(

)()()(

=

Ω

=∂

∂=

−=

∂∂

Ω

η

λη

θ

θη

d

jk k

kj

j

x

jj

jk kj

x

qx

dxxxxq

xx

j

particle flow

136

more details of direct integration:

∫ ∫Ω

=

−=

=

such thatfunction arbitrary )(x

in which

2kfor )()()(

)()(

ηρη

η

k

x

kkk dxdxxxxq

xqdiv

k

k

Ω

Ω

=

Ω

=

0(x)dx that is q(x)solution a of

existence for thecondition sufficient &necessary a

set,smooth connected open, bounded, is and

support,compact with functionssmooth assuming

1)(xk

k

η

ρ kdx

k

137

Oh’s Formula for Monte Carlo errors

Nkk

kd

/21

exp21

1 22

+

+

+≈

εσ

Assumptions:

(1) Gaussian density (zero mean & unit covariance matrix)

(2) d-dimensional random variable

(3) Proposal density is also Gaussian with mean ε and covariance matrix kI, but it is not exact for k ≠ 1 or ε ≠ 0

(4) N = number of Monte Carlo trials

138

nonlinear filtering problem

x = d-dimensional state vector

t = time

w(t) = process noise vector

dx = F(x, t)dt + G(x, t) dwcontinuous

time

dynamicsdiscrete

time

measurements

z(tk) = m-dimensional measurement vector

tk = time of kth measurement

vk = measurement noise vector

p(x, t | Zk) = probability density of x at time t given Zk

Zk = set of all measurements up to & including time tk

z(tk) = H(x(tk), tk, vk)

measurements

139

difficulties for exact finite dimensional filters

vs. particle filters

Bayes’ update of

conditional density

of x

prediction of

conditional density

of x with time

1. exact filters

(e.g., Daum 1986)

easy hard

(e.g., Daum 1986)

2. particle filters hard easy

3. hybrid of exact

& particle filters

? ?

140

What is a particle filter?

141

Prediction of

conditional

probability

density from

tk-1 to tk

Update conditional

probability density

particles particles

Particle Filter

Probability

density is

represented

by particles

measurements

Importance sampling

from proposal density

(Monte Carlo or QMC

sampling)

probability density

using current

measurement

& Bayes’ rule

particles

Monte

Carlo

or QMC

simulation

of dynamics

142

103

104

105

Velo

cit

y E

rro

r (m

/sec)N = 100, σσσσ

r = 100m

EKF

Inexact Flow

Exact Flow

Exact Flow with Redraw

radar tracking ballistic missile

(d =6 & N = 100 particles)

incompressible

flow

0 20 40 60 80 10010

0

101

102

10

Time (sec)

Velo

cit

y E

rro

r (m

/sec)

143

exact

flow

EKF

method to solve PDE how to pick unique solution computation

1. generalized inverse of linear differential

operator

minimum norm* Coulomb’s law or fast Poisson solver

2. Poisson’s equation gradient of potential*

(assume irrotational flow)

Coulomb’s law or fast Poisson solver

3. generalized inverse of gradient of log-

homotopy

assume incompressible flow (i.e.,

divergence free flow)

fast (but need to compute the gradient

of logp(x, λ) from random points)

4. most general solution most robustly stable filter or random

pick, etc.

fast (but need to compute the gradient

of logp(x, λ) from random points)

5. separation of variables (Gaussian) pick solution of specific form

(polynomial)

extremely fast (formula for flow)

6. separation of variables

(exponential family)

pick solution of specific form (finite basis

functions)

very fast (formula for flow)

7. variational formulation (Gauss & Hertz) convex function minimization ODEs

8. optimal control formulation convex functional minimization (e.g.,

least action like Monge-Kantorovich)

HJB PDE or Euler-Lagrange PDEs

(or maybe ODES for nice problem?)

9. direct integration (of first order linear PDE in

divergence form)

choice of d-1 arbitrary functions one-dimensional integral

10. generalized method of characteristics more conditions (e.g., curl = given &

chain rule)

ODEs from chain rule

11. another homotopy (inspired by Gromov’s

h-principle) like Feynman’s QED perturbation

initial condition of ODE &

uniqueness of sol. to ODE

ODEs from homotopy

12. finite dimensional parametric flow

(e.g., f = Ax+b with A & b parameters)

non-singular matrix to invert d³ or d^6 (least squares for d or d² parameters, i.e., A & b)

13. Fourier transform of PDE (divergence form

of linear PDE has constant coefficients!)

minimum norm* or most stable flow Coulomb’s law or fast Poisson solver144

(1) Flavia Lanzara , Vladimir Maz’ya & Gunther Schmidt

“On the fast computation of high dimensional volume

potentials” arXiv:0911.0443v1 [math.NA] 2 Nov 2009

note: linear computational complexity in d for uniform grid,

and it can be extended to scattered data!

145

and it can be extended to scattered data!

(2) huge literature on fast Poisson solvers (e.g., FMM

Rokhlin, Beylkin, Coifman, Hackbush, et al.)

top related