particle flow for nonlinear filters

particle flow for

nonlinear filters

Fred Daum

19 June 2012

Customer Success Is Our Mission is a trademark of Raytheon Company.

discrete time nonlinear filter problem

wttxFtx

tat timet vector measuremen

tat time vector noise process

tat time vectorstate)(

),),(()(

=+estimate x

given noisy

measurements

vttxHz

,...,,Z

tsmeasuremen all ofset

given Z tat time x ofdensity y probabilit),(

tat time vector noiset measuremen

),),((

many applications of filtering & prediction

roboticsnavigation communicationscontrol of chemical,

mechanical, electrical & nuclear plants

guidance

tracking missiles, satellites, aircraft, ground vehicles,

artillery shells, comets, people, cows, salmon

weather & climate prediction

predicting ionosphere, thermosphere, troposphere

fancy signal processing

science imagingmedicine (e.g., MRI, surgery, pace makers,

drug design, diagnosis)

fancy signal processing

(e.g., MIMO radar, MIMO sonar, MIMO comm, MIMO nav)

oil & mineral exploration

financial engineering adaptive antennasaudio & video signal

processing

Bayesian decisions multi-sensor data fusion compressive sensing other….

type of

nonlinear

filter

statistics computed computational

complexity

estimation

accuracy

representation

of probability

density

extended

Kalman filters

mean vector &

covariance matrix d³

sometimes good

but often highly

suboptimal

mean vector &

covariance matrix

unscented

Kalman filters

mean vector &

sometimes better

than EKF but

sometimes worse

mean vector &

covariance matrix

batch least

squares

mean vector &

sometimes better

than EKF but

sometimes worse

mean vector &

covariance matrixd³sometimes worse

numerical

solution of

Fokker-Planck

full conditional

probability density of

statecurse of

dimensionality

optimal* points in state space

and/or smooth

functions

particle filters

full conditional

curse of

dimensionality

optimal* particles

exact recursive

filters (Kalman,

Beneš, Daum,

Wonham, Yau)

full conditional

polynomial in d

(for special

problems)

optimal

(for special

problems)

sufficient

statistics

dimension of

the state vector

extended

Kalman

filters

particle flow

filters

nonlinearity or non-Gaussianity

standard particle filters

filters

exact flow filter is many orders of magnitude faster per

particle than standard particle filters

- - - - -

bootstrap

EKF proposal

incomp flow

exact flow

r 30 U

d = 30

d = 20

d = 10

d = 5 bootstrap

particle filter

EKF proposal

* Intel Corel 2 CPU, 1.86GHz, 0.98GB of RAM, PC-MATLAB version 7.7

25 Monte Carlo trials10

Number of Particles

r 30 U

exact flow

EKF proposal

incompressible

particle flow filter is many orders of magnitude faster

real time computation (for the same or better

estimation accuracy)

3 or 4 orders of

3 or 4 orders of magnitude faster

per particle

avoids bottleneck in

many orders of

magnitude faster

3 or 4 orders of magnitude

fewer particles

bottleneck in parallel

processing due to resampling

PF Incompressible

new filter improves accuracy by

two orders of magnitude

median error

N = 500 particles

extended Kalman filter

standard particle filter

0 2 4 6 8 1010

Time (sec)

PF Incompressible

PF Ax+BN = 500 particles

new filter

key idea: small curvature flow

(inspired by fluid dynamics) to make

solution of PDE div(pf) = η much faster

Euler’s equations:

3121233

2313122

1232311

ωωω

Why engineers like particle filters:

• Very easy to code

• Extremely general dynamics & measurements: nonlinear & non-Gaussian

• Optimal estimation accuracy (if you use enough • Optimal estimation accuracy (if you use enough particles….)

• You don’t need to know anything about stochastic differential equations or any fancy numerical methods for solving PDEs

• Some people (erroneously) think that PFs beat the curse of dimensionality

curse of dimensionality for

classic particle filter

optimal

accuracy:

r = 1.0

prediction of

conditional

probability

density from

tk-1 to tk

nonlinear filter

measurements

:rule Bayes'

solution of

Fokker-Planck

equation

),(),(),(

:rule Bayes'

1 kkkkkk txzpZtxpZtxp −=

particle degeneracy*

likelihood of

measurementprior

density

particles to represent the prior

*Daum & Huang, “Particle degeneracy: root cause & solution,” SPIE Proceedings 2011.

chicken & egg problem

How do you pick a

good way to represent

the product of two

functions before you functions before you

compute the product

itself?

induced flow of particles

for Bayes’ rule

pdf pdfflow of density

prior posterior

)(log)(log),(log xhxgxp λλ +=

particles particles

flow of particles

sample from

density

sample from

density

λ=0 λ=1

),( λλ

derivation of PDE for exact particle flow:

∂−=

pfTrxp

),(log

),( Fokker-Planck

equation with zero

diffusion

definition

of p(x,λ)

−−=

∂−

∂∂

λλη

λλλ

Kdxhxp

pfdivxpK

Kxhxgxp

)(log)(log),(

)(),()(log

)(log)(log)(log),(log

definition

PDE for

f given p

linear first order highly underdetermined PDE:

Kdxhxpx

xxqdiv

−−=

)(log)(log),(),(

),()),((

λλλη

ληλ

like Gauss’ divergence

law in electromagneticsdxxx ∂∂∂ 21

function values are

only known at

random points in d-

dimensional space

q = pf

f = unknown function

p & η = known at random points

We want dx/dλ = f(x,λ)

to be a stable dynamical

system.

law in electromagnetics

likelihood of

measurementprior

densityoptimal

accuracyg h

root cause of

curse of dimensionality:

particles to represent the prior

pdf pdf

particles particles

flow of density

flow of particles

sample from

density

sample from

density

λ=0 λ=1

prior posterior

),( λλ

f.for PDE above thesolving

by flow particle design the We

loglog)(

Kdhppfdiv

method to solve PDE how to pick unique solution comments

1. generalized inverse of linear differential operator minimum norm* very difficult to design robust stable & fast algorithm

2. Poisson’s equation gradient of potential*(assume irrotational flow)

very difficult to design robust stable & fast algorithm

3. generalized inverse of gradient of log-homotopy assume incompressible flow & pick minimum L² norm solution

workhorse for multimodal densities

4. generalization of method #3 most robustly stable filter or random pick, etc.

workhorse for multimodal densities

5. separation of variables (Gaussian) pick solution of specific form (polynomial) extremely fast & hard to beat in accuracy for many problems

6. separation of variables

(exponential family)

pick solution of specific form (finite basis functions)

needs theoretical work & numerical experiments

7. variational formulation (Gauss & Hertz) convex function minimization needs work

8. optimal control formulation convex functional minimization (e.g., least action like Monge-Kantorovich)

very high computational complexityaction like Monge-Kantorovich)

9. direct integration (of first order linear PDE in divergence form)

choice of d-1 arbitrary functions should work with enforcement of neutral charge density & importance sampling

10. generalized method of characteristics more conditions (e.g., small curvature or specify curl, or use Lorentz invariance)

11. another homotopy (inspired by Gromov’s

h-principle) like Feynman’s QED perturbation

initial condition of ODE &

uniqueness of sol. to ODE

12. finite dimensional parametric flow

(e.g., f = Ax+b with A & b parameters)

non-singular matrix to invert needs numerical experiments

13. Fourier transform of PDE (divergence form of linear PDE has constant coefficients!)

minimum norm* or most stable flow very difficult to design robust stable & fast algorithm

14. small curvature flow & assumed prior density solve d x d system of linear equations new in 2012. Beats other methods for difficult nonlinear problems.

15. small curvature flow & homotopy for inverse of A + B (sum of two linear operators)

numerically integrate ODE new in 2012. extremely cool theory.

16. small curvature flow & homotopy for generalized inverse of A + B (sum of two linear operators)

numerically integrate ODE new in 2012

exact particle flow for Gaussian densities:

∂−−=−

:exactly ffor solvecan weGaussian,h & gfor

)(log)log(

( ) ( )[ ]xAzRPHAIAIb

HRHPHPHA

automatically stable

under very mild

conditions &

extremely fast

incompressible particle flow

gradient zero-nonfor

),(log

))(log(dx

∂−

otherwise 0

gradient zero-nonfor ),(log

))(log(

irrotational particle flow:

∫ −

≥−

)logK(

-logh(y))p(y,)V(x,

3 dfor ),(),(

),(),(

),(/),(

λλλ

ληλ

λλ Poisson’s

equation

−−

∂−≈

−−

∂−=

−−

∂−=

− ∂

xxdcKxh

yxdcKyhE

yxdcKyhyp

))(2()

)(log)((log

))(2()

)(log)((log

))(2()(log)(log),(

-logh(y))p(y,)V(x,2

λλλ

Coulomb’s

0 5 10 15 20 25 30 35 400

300N = 1000

0 5 10 15 20 25 30 35 400

400Time = 1, Frame 1

initial probability distribution of particles:

λ = 0.0

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.1

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.2

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.3

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.4

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.5

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.6

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.7

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.8

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

λ = 0.9

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

final probability distribution of particles (resulting from

one noisy measurement of sin(θ) with Bayes’ rule):

λ = 1

-200 -150 -100 -50 0 50 100 150 200-300

Angle (deg)

induced flow of particles

for Bayes’ rule

pdf pdfflow of density

prior posterior

particles particles

flow of particles

sample from

density

sample from

density

λ=0 λ=1

),( λλ

d = 12, ny = 3, y = x2, SNR = 20dB

extended Kalman filter (EKF)

particle filter beats the

EKF by two orders of

magnitude in accuracy

Number of Particles

quadratic measurement

nonlinearity 35

particle filter

5 Time = 1, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.5 0 0.5 1 1.5

d = 12, ny = 3, y = x

3, SNR = 20dB

Number of Particles

particle filter

cubic measurement

nonlinearity

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

1.5x 10

-1 -0.5 0 0.5 1 1.5 2

direct integration of PDE

xxqdiv

),()),((

:PDE theof form divergence theuse

ληλ

∫ −+=

∂∂∂

),(~),(q~q

q~ q ofcomponent onebut allpick

~-)q~-div(q

~)q~div( problem related ofsolution exact ~

ληλη

most general solution for exact flow:

)(log-Cf

:issolution generalmost the

log)()log(

## −+=

∂−−=

pfdivh

fundamental

PDE for exact

particle flow

vectorldimensiona-darbitrary y

C of inverse dgeneralize

:operator aldifferentilinear a is Cin which

)(log-Cf

could pick y to

robustly stabilize the

filter or random or

computing the generalized inverse of A + B:

BA)C( :homotopy a define

case) in this isit but truegenerally (not 1CC

C of inverse pseudoC

B A C define

µµpick A that is easy

to invert & pick

any B (i.e., could

( ) #####

BCCCCd

solve by numerical

integration from

µ = 0 to 1

any B (i.e., could

be hard to invert)

nonlinear filter performance (accuracy wrt

optimal & computational complexity)

DIMENSIONprocess noise

initial uncertainty

of state vector

measurementexploit

smoothness

sparseness

measurement

stability

& mixing

of dynamics

multi-modal

nonlinearityill-conditioning

quality of

proposal density

concentration

of measure

exploit

structure (e.g.

exact filters)

variation in initial uncertainty of x

N = 1000, Stable, d = 10, Quadratic

Huge Initial Uncertainty

Large Initial Uncertainty

Medium Initial Uncertainty

Small Initial Uncertainty

, λλλλ = 0.6

25 Monte Carlo Trials

0 5 10 15 20 25 3010

variation in eigenvalues of the plant (λ)

N = 1000, d = 10, Cubic

λλλλ = 0.1

0 5 10 15 20 25 3010

variation in dimension of x

N = 1000, λλλλ = 1.0, Cubic

Dimension = 5

0 5 10 15 20 25 3010

numerical results so far

applications:

• long range wideband radar

tracking TBMs & ICBMs

• angles only tracking (multiple

radars & 1 radar)

key parameters:

• dimension of state vector of

dynamical model (plant)

• initial uncertainty in the state

vectorradars & 1 radar)

• discrimination for wideband

radar (BMD)

• linear systems

• quadratic nonlinearity

• cubic nonlinearity

• cosine nonlinearity

• Euler’s equations (6 DOF)

vector

• stability of the plant

• signal-to-noise ratio of

measurements

• trajectory of objects

• radar-object geometry

particle flow filter is many orders of magnitude faster

real time computation (for the same or better

estimation accuracy)

3 or 4 orders of

3 or 4 orders of magnitude faster

per particle

avoids bottleneck in

many orders of

magnitude faster

3 or 4 orders of magnitude

fewer particles

bottleneck in parallel

processing due to resampling

many new ideas

direct integration of PDE but enforce

neutral charge & exploit known exact solutions

separation of variables

(exponential family & arbitrary densities)

method of characteristics (take

gradient to get d eqs in d unknowns)

small curvature flow (take gradient to get d eqs. in d unknowns)

variance reduction for Poisson’s eq exploiting

f = Ax+b or other

log speed & unit vector for Coulomb’s law

inverse or generalized inverse of A+B like Feynman’s QED

perturbation theory

Gelman & Mengoptimal flow

Lagrange’s trick assumed densities Lagrange’s trick

Gromov’s h-principle

renormalization group flow

assumed densities (Gauss-arbitrary,

exponential family-arbitrary, Gauss-

exponential family, etc.)

Le Gland more general homotopy than log-

homotopy (e,g, log-log)

harvest ideas from Monge-Kantorovich

optimal trasnport

big/small parameter approximations (far field, narrow band, steady state, large

number of particles, 1/137, etc.)

parametric flow to enforce stability

non-unique solutions of PDE (mixed in x, λ &

components of x)

Galerkin or

homotopy-Galerkin or differential quadrature

Q ≥ 0 with Dirac approximation to solve

Fokker-Planck PDE

Q ≥ 0 with Daum exact solutions of Fokker-

Planck (1986)

HYBRIDS

of above

obtain unique solution to Gauss div law by enforcing Lorentz invariance (like Maxwell’s eqs) 66

small curvature particle flow:

∂−

∂−=

∂−−=−

:flow) ibleincompress and bAx f (and flow curvature smallfor

loglog)(

:PDE above ofgradient thecompute

)(log)log(

∂−=

logloglog

:hence

div(f)

• extremely fast to compute

• Hessian of logp is

always non-singular

• similar to Fisher matrix

• generalizes our two

favorite flows!

PF Incompressible

new filter improves accuracy by

two orders of magnitude

median error

N = 500 particles

0 2 4 6 8 1010

Time (sec)

PF Incompressible

PF Ax+BN = 500 particles

new filter

key idea: small curvature flow

(inspired by fluid dynamics) to make

solution of PDE div(pf) = η much faster

Euler’s equations:

3121233

2313122

1232311

ωωω

0div(f)

:flow ibleincompress

=)b()xA(f

:flowGaussian

+= λλ

small curvature flow:

0div(f) =Tr(A)div(f)

)b()xA(f

+= λλ

small curvature flow (even better):

∂−

∂−=

∂−−=−

loglog)(

:PDE above ofgradient thecompute

)(log)log(

∂−=

logloglog

:hence

div(f)

computing the inverse of A + B:

BA)G( :homotopy a define

G of inverse G

B A G define

pick A that is easy

to invert & pick

A & B are linear operators (e.g., matrices or

differential operators)

−−−

−−

solve by numerical integration from

µ = 0 to 1with the obvious initial condition

to invert & pick any B (i.e., could

be hard to invert)

new nonlinear filter: particle flow

new particle flow filter standard particle filters

many orders of magnitude faster than

standard particle filters

suffers from curse of dimensionality

3 to 4 orders of magnitude faster per

particle

suffers from “particle degeneracy”

3 to 4 orders of magnitude fewer particles

required to achieve optimal accuracy

requires millions or billions of particles

for high dimensional problems

Bayes’ rule is computed using particle

flow (like physics)

Bayes’ rule is computed using a pointwise

multiplication of two functions

no proposal density depends on proposal density (e.g.,

Gaussian from EKF or UKF)

no resampling of particles resampling is needed to repair the damage

done by Bayes’ rule

embarrassingly parallelizable suffers from bottleneck due to resampling

computes log of unnormalized density suffers from severe numerical problems

due to computation of normalized density72

History of Mathematics

1. Creation of the integers

2. Invention of counting

3. Invention of addition as a fast

method of countingmethod of counting

4. Invention of multiplication as a

fast method of addition

5. Invention of particle flow as a

fast method of multiplication*

Fred Daum, Jim Huang & Arjang Noushin, “exact particle flow for nonlinear filters,” Proceedings of SPIE Conference, Orlando Florida,

April 2010.

Fred Daum & Jim Huang, “particle degeneracy: root cause & solution,” Proceedings of SPIE Conference, Orlando Florida, April 2011.

Fred Daum & Jim Huang “numerical experiments for nonlinear filters with exact particle flow induced by log-homotopy,” Proceedings of SPIE Conference, Orlando Florida, April 2010.Conference, Orlando Florida, April 2010.

Fred Daum & Jim Huang, “exact particle flow for nonlinear filters: seventeen dubious solutions to a linear first order underdetermined PDE,” Proceedings of IEEE Conference, Asilomar California, November 2010.

Fred Daum, Jim Huang & Arjang Noushin, “Coulomb’s law particle flow for nonlinear filters,” Proceedings of SPIE Conference, San Diego California, August 2011.

BACKUPBACKUP

fluid dynamics

electro-magnetics

plasma physics

quantum field

theorybig bang

1. derive PDE

2. solve PDE

3. test solution

physics & nonlinear filters

Metropolis algorithm &

particle filters

(Fokker-Planck, Gauss div law, Maxwell’s eqs,

continuity equation, Einstein field eqs., etc.)

Beneš advice:

“think like a fluid mechanic”

Coulomb’s law &

particle flow

path integrals for NLF (Pierre Del

Moral & Bhashyam Balaji & Mike Schilder, et al.)

Why is the maximum speed of particles c ?

incompressible flow

irrotational flow

small curvature flow

Dan Zwillinger’s

advice: “think like a physicist”

Lie algebras & gauge theory for NLF

(Brockett & Mitter & Marcus & Ocone et al.)

Dudley Herschbach

Mark Kac & Ed Witten

BIG BANG

inflation

cosmic acceleration

quantization of physical particles & particles in

filters also!

(via resampling)

renormalization group flow & particle flow

(Bhashyam Balaji)

Hamiltonian Monte Carlo

“the physicists are always 20 years

ahead of us”

Prof. Jun Liu

strings vs. particles

(Francois Le Gland 2001)

numerical results so far

applications:

• long range wideband radar

tracking TBMs & ICBMs

• angles only tracking (multiple

radars & 1 radar)

key parameters:

• dimension of state vector of

dynamical model (plant)

• initial uncertainty in the state

vectorradars & 1 radar)

• discrimination for wideband

radar (BMD)

• linear systems

• quadratic nonlinearity

• cubic nonlinearity

• cosine nonlinearity

• Euler’s equations (6 DOF)

vector

• stability of the plant

• signal-to-noise ratio of

measurements

• trajectory of objects

• radar-object geometry

d = 12, ny = 3, y = x2, SNR = 20dB

quadratic measurement nonlinearity

Number of Particles

particle filter

d = 12, ny = 3, y = x

3, SNR = 20dB

cubic measurement nonlinearity

Number of Particles

particle filter

0 5 10 15 20 25 30 35 400

300N = 1000

nonlinear measurement (sine function of θ)

0 5 10 15 20 25 30 35 40

0 5 10 15 20 25 30 35 400

Euler’s equations of rotational motion

with wideband range measurements

median

error in

vector

new filter

0 2 4 6 8 1010

Time (sec)

PF Incompressible

PF Ax+B

N = 500 particles

50 Monte Carlo runs

new particle filter

new filter

improves

accuracy by two

orders of

magnitude

N = 100, σσσσr = 100m

Inexact Flow

Exact Flow

Exact Flow with Redraw

radar tracking ballistic missile

(d =6 & N = 100 particles)

incompressible

0 20 40 60 80 10010

Time (sec)

median range estimation error (angles only data)

0 50 100 150 200 250 300 350 40010

Time (sec)

EKF: solid

Log Homotopy PF: dashed

angle = 90 deg

angle = 75 deg

angle = 25 deg

angle = 10 deg

both EKF & PF

use modified spherical

coordinates

angle = 10 deg

median range estimation error (angles only data)

extended Kalman

filter

0 50 100 150 200 250 300 350 40010

Time (sec)

both EKF & PF

use modified spherical

coordinates

exact flow filter

3D position error, angle 1 sigma = 0.1 mrad

Radar separation = 0 m

1000 m

two radars air targets angles only

0 20 40 60 80 100 12010

Time (sec)

excellent

accuracy

from two

radars

2000 m

5000 m

10000 mexcellent

accuracy

from two

radars

0 20 40 60 80 100 12010

Time (sec)

1000 m

0 20 40 60 80 100 12010

Time (sec)

excellent

accuracy

from two

radars

Euler’s equations of rotational motion

with wideband range measurements

median

error in

vector

new filter

0 2 4 6 8 1010

Time (sec)

PF Incompressible

PF Ax+B

N = 500 particles

50 Monte Carlo runs

new particle filter

new filter

improves

accuracy by two

orders of

magnitude

Stable, d = 30, Linear

Inexact Flow

Exact Flow

only need N = 100

particles for optimal

accuracy for d = 30

Number of Particles

accuracy for d = 30

dimensional problem

exact flow: performance vs. number of particles

λλλλ = 1.2, Linear, Large Initial Uncertainty

Dimension = 5

30extremely

unstable

Number of Particles

item exact particle flow Monge-Kantorovich

optimal transport

1. purpose fix particle degeneracy due

to Bayes’ rule

move physical objects with

minimal effort from p1 to p2

2. conservation of

probability mass along flow

yes yes

3. deterministic yes* yes

4. homotopy of density no yes

5. log-homotopy of

density

yes no

6. optimality criteria none minimum action, etc.6. optimality criteria none minimum action, etc.

7. how to pick a solution a dozen distinct methods minimum action, etc.

8. stability of flow explicitly

considered

yes rarely

9. high dimensional

applications solved

successfully

yes (d ≤ 30) no (d = 1, 2 or 3)

10. computational

complexity

numerical integration of

ODE for each particle

solution of Monge-Ampere

nonlinear PDE or other PDE

11. solution of PDE for nice

special cases

many (incompressible,

irrotational, Gaussian, etc.)

only irrotational94

fast Ewald’s method vs. Coulomb’s law

item fast Ewald method

in physics &

chemistry*

Coulomb’s law

with fast k-NN

comments

1. dimension of x d = 3 d = 3 to 30 rapid decay of

Coulomb kernel in

higher d helps!

2. relative error

desired

0.0001 or better 1% to 10% all Ewald methods the

same for 1% accuracy

3. cut-off in x space fixed distance random per k-NN automatic space-taper

to weight convolution

4. desired force on mesh at particles big difference!

5. neutral charge locally enforced locally enforced crucial

6. smoothing charge Gaussian no explicit smoothing

7. k-space or real-

both real space (x) no FFT needed for

Coulomb

*Shan, Klepeis, Eastwood, Dror & Shaw, “Gaussian split Ewald: a fast Ewald mesh

method for molecular simulation,” Journal of Chemical Physics, 2005.

new exact particle flow: operator-valued homotopy

Improved version of incompressible flow (does not assume that div(f) = 0 and makes no other assumptions about the problem)

Avoids computing p(x,λ) or dividing by p (unlike Coulomb’s law), but rather it uses the gradient of the log of p(x,λ)

Does not depend on EKF or UKF

Completely general (highly non-Gaussian multimodal densities and compressible flow)

Compute the flow dx/dλ using the generalized inverse of the sum of two linear differential operators (A + B) using an ODE derived with a homotopy (similar to matrix inversion using ODE derived by homotopy); exploits easy computation of generalized inverse of A; A = gradient of log p(x,λ) as in incompressible flow & B = divergence

Inspired by: (1) Feynman’s perturbation approximation for QED but we do not approximate by exploiting small parameter 1/137 (see Kaku‘s book on QFT page 137) and (2) homotopy for matrix inverse and (3) generalized inverse of sum of two matrices in certain special cases (see page 50 in Campbell & Meyer book), as well as (4) Gromov’s h-principle; but our algorithm is new & distinct from these methods.

d = 12, ny = 3, y = x

3, SNR = 20dB

or cubic measurement

nonlinearity

Number of Particles

finite dimensional parametric approximation:

logplog(h)J

)b()xA()f(x,let

rATrbAx

rfdivfx

+++∂

λλλ

donly toreduced becan thisA sparse

forbut ,d is complexity nalcomputatio however, )5(

)(Tr(A) div(f) that note (4)

stablerobustly flow make penalty to add also could (3)

flow ofstability force to-BBAlet could (2)

xparticleeach at J minimize tob & Afor solve )1(

A∑==

:flow)exact our for usual (as EKFan usingGaussian

h& gith solution wexact our from computed are b &A in which

)c(x,)b()xA()f(x,let ,generality of lossWithout

)(log)(x

)logp(x,

:PDE following thesatisfies that )f(x, flow a find want toWe

xhfdivf

−=+∂

λλλλ

hybrid method:

loglog

:(.) of inverse-pseudo thedenotes (.)in which

)log()()(log

:)c(x, norm minimum thecompute and 0,div(c) that Assume

)log()()()(x

hATrbAxx

hcdivATrcbAx

∂−≈

−=++++∂

recall derivation of incompressible flow:

∂−−=−

)(log)log(

∂−=

)(loglog

:inverse-pseudo theusing ffor solve and

0,div(f) ible,incompress is flow that theassume

recall incompressible particle flow:

log)(log)(log

−−

gradient zerofor 0

new derivation of compressible flow:

∂−−=−

)(log)log(

∂−=

Kdhdiv

)(loglog

:inverse-pseudo theusing ffor solve and

ible,incompress is flow that theassumenot willwe

particle flow filter

• orders of magnitude faster than standard particle filters

• orders of magnitude more accurate than the extended

Kalman filter for difficult nonlinear problems

• solves particle degeneracy problem using particle flow

induced by log-homotopy for Bayes’ rule

• no resampling of particles• no resampling of particles

• no proposal density

• no importance sampling & no MCMC methods

• unnormalized log probability density

• embarrassingly parallelizable w/o resampling bottleneck

(unlike other particle filters)

• exploits smoothness & regularity of densities

effect of divergence of f:

log)()log(

pfdivh

∂−−=

= λλ

suppose

that div(f) is

[ ])(loglog

fdivhx

∂−=

effect of div(f)

is to change

the speed of

particle flow

if div(f) = 0

we get

incompressible

flow104

log)log(

∂−

gradient zerofor 0

∂ ∂=

angles only tracking

d = 6 and N = 500 particles

exact flow filter

0 50 100 150 200 250 300 350 40010

Time (sec)

EKF: Solid

Log Homotopy PF: Dashed

Tincknell Angle = 90 deg

median range error

(100 Monte Carlo runs)

Tincknell angle = 90 deg

exact flow filter

0 50 100 150 200 250 300 350 40010

Time (sec)

Log Homotopy PF

median range error

exact flow filter

0 50 100 150 200 250 300 350 40010

Time (sec)

Log Homotopy PF

median range error

exact flow filter

0 50 100 150 200 250 300 350 40010

Time (sec)

Log Homotopy PF

median range error

exact flow filter

0 50 100 150 200 250 300 350 40010

Time (sec)

Log Homotopy PF

median range error

exact particle flow for Gaussian densities:

pfdivh

∂−−=

log)()log(

),( λλ

( ) ( )[ ]xAzRPHAIAIb

HRHPHPHA

automatically stable

under very mild

conditions &

extremely fast

0.8Inside = 5 percent, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

0.8Inside = 10.6 percent, Magenta: truth, Green: PF estimate, Black: KF

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

log)log(

∂−

gradient zerofor 0

∂ ∂=

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6-0.8

Hessia

variation in SNR

N = 1000, Stable, d = 10, Linear

SNR = 10 dB

, λλλλ = 0.6

0 5 10 15 20 25 3010

variation in process noise

N = 1000, Stable, d = 10, Quadratic

σσσσq = 0.01X

, λλλλ = 0.6

0 5 10 15 20 25 3010

particle flow filter

• orders of magnitude faster than standard particle filters

• orders of magnitude more accurate than the extended

Kalman filter for difficult nonlinear problems

• solves particle degeneracy problem using particle flow

induced by log-homotopy for Bayes’ rule

• no resampling of particles• no resampling of particles

• no proposal density

• no importance sampling & no MCMC methods

• unnormalized log probability density

• embarrassingly parallelizable w/o resampling bottleneck

(unlike other particle filters)

• exploits smoothness & regularity of densities

fundamental PDE for exact particle flow:

pfTrxp

∂−=

),(log

Fokker-Planck

equation with Q = 0

assume

log-homotopy

pfdivh

fTrxpxpxh

xhxgxp

∂−−=

∂−

∂−=

log)()log(

),(),()(log

)(log)(log),(log

log-homotopy

first order linear

underdetermined

PDE in f(x,λ)135

direct integration of fundamental PDE:

),(),(

:PDE theof form divergence theuse

∂−=

∑≠

ληλ

xqTrxqdiv

pick for best

stability of

particle flow

.0(x)dx iff

exists )q(x,for solution a , and on conditions regularity Assuming

)conditionsity compatibilfor (except function arbitrary )(

)()()(

∂∂

dxxxxq

particle flow

more details of direct integration:

∫ ∫Ω

such thatfunction arbitrary )(x

in which

2kfor )()()(

ηρη

kkk dxdxxxxq

0(x)dx that is q(x)solution a of

existence for thecondition sufficient &necessary a

set,smooth connected open, bounded, is and

support,compact with functionssmooth assuming

ρ kdx

Oh’s Formula for Monte Carlo errors

Assumptions:

(1) Gaussian density (zero mean & unit covariance matrix)

(2) d-dimensional random variable

(3) Proposal density is also Gaussian with mean ε and covariance matrix kI, but it is not exact for k ≠ 1 or ε ≠ 0

(4) N = number of Monte Carlo trials

nonlinear filtering problem

x = d-dimensional state vector

t = time

w(t) = process noise vector

dx = F(x, t)dt + G(x, t) dwcontinuous

dynamicsdiscrete

measurements

z(tk) = m-dimensional measurement vector

tk = time of kth measurement

vk = measurement noise vector

p(x, t | Zk) = probability density of x at time t given Zk

Zk = set of all measurements up to & including time tk

z(tk) = H(x(tk), tk, vk)

measurements

difficulties for exact finite dimensional filters

vs. particle filters

Bayes’ update of

conditional density

prediction of

conditional density

of x with time

1. exact filters

(e.g., Daum 1986)

easy hard

(e.g., Daum 1986)

2. particle filters hard easy

3. hybrid of exact

& particle filters

What is a particle filter?

Prediction of

conditional

probability

density from

tk-1 to tk

Update conditional

probability density

particles particles

Particle Filter

Probability

density is

represented

by particles

measurements

Importance sampling

from proposal density

(Monte Carlo or QMC

sampling)

probability density

using current

measurement

& Bayes’ rule

particles

or QMC

simulation

of dynamics

/sec)N = 100, σσσσ

r = 100m

Inexact Flow

Exact Flow

radar tracking ballistic missile

(d =6 & N = 100 particles)

incompressible

0 20 40 60 80 10010

Time (sec)

method to solve PDE how to pick unique solution computation

1. generalized inverse of linear differential

operator

minimum norm* Coulomb’s law or fast Poisson solver

2. Poisson’s equation gradient of potential*

(assume irrotational flow)

Coulomb’s law or fast Poisson solver

3. generalized inverse of gradient of log-

homotopy

assume incompressible flow (i.e.,

divergence free flow)

fast (but need to compute the gradient

of logp(x, λ) from random points)

4. most general solution most robustly stable filter or random

pick, etc.

fast (but need to compute the gradient

of logp(x, λ) from random points)

5. separation of variables (Gaussian) pick solution of specific form

(polynomial)

extremely fast (formula for flow)

6. separation of variables

(exponential family)

pick solution of specific form (finite basis

functions)

very fast (formula for flow)

7. variational formulation (Gauss & Hertz) convex function minimization ODEs

8. optimal control formulation convex functional minimization (e.g.,

least action like Monge-Kantorovich)

HJB PDE or Euler-Lagrange PDEs

(or maybe ODES for nice problem?)

9. direct integration (of first order linear PDE in

divergence form)

choice of d-1 arbitrary functions one-dimensional integral

10. generalized method of characteristics more conditions (e.g., curl = given &

chain rule)

ODEs from chain rule

11. another homotopy (inspired by Gromov’s

h-principle) like Feynman’s QED perturbation

initial condition of ODE &

uniqueness of sol. to ODE

ODEs from homotopy

12. finite dimensional parametric flow

(e.g., f = Ax+b with A & b parameters)

non-singular matrix to invert d³ or d^6 (least squares for d or d² parameters, i.e., A & b)

13. Fourier transform of PDE (divergence form

of linear PDE has constant coefficients!)

minimum norm* or most stable flow Coulomb’s law or fast Poisson solver144

(1) Flavia Lanzara , Vladimir Maz’ya & Gunther Schmidt

“On the fast computation of high dimensional volume

potentials” arXiv:0911.0443v1 [math.NA] 2 Nov 2009

note: linear computational complexity in d for uniform grid,

and it can be extended to scattered data!

(2) huge literature on fast Poisson solvers (e.g., FMM

Rokhlin, Beylkin, Coifman, Hackbush, et al.)

particle flow for nonlinear filters

Documents

Nonlinear Data Assimilation and Particle Filters Peter Jan.....

Particle Filters and Their Applications

HMMs and Particle Filters

Particle Filters and Their Applications -...

Particle Filters en Niet-Lineair Schatten

AN OVERVIEW OF EFFICIENT NONLINEAR...

Interacting and Annealing Particle Filters: Mathematics...

Kalman Filters for Nonlinear Systems

Introduction to Particle Filters for Data...

Particle Filters for Rover Fault Diagnosis

Nonlinear Regularized Reaction-Diﬀusion Filters for ...

Particle Filters - Washington...

Lecture 04: particle filters + Kalman...

Complex Valued Nonlinear Adaptive Filters

CRF-Filters: Discriminative Particle Filters for Sequential....

Bayesian filtering: From Kalman filters to particle filters....