Top Banner
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots of help also from: Steven Fletcher, Laura Fowler, Tarendra Lakhankar, Scott Longmore, Manajit Sengupta, Tom Vonder Haar, Dusanka Zupanski, and Milija Zupanski
73

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

Dec 21, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20081

What is Data Assimilation?A Tutorial

Andrew S. Jones

Lots of help also from:

Steven Fletcher, Laura Fowler, Tarendra Lakhankar, Scott

Longmore, Manajit Sengupta, Tom Vonder Haar, Dusanka

Zupanski, and Milija Zupanski

Page 2: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20082

Data Assimilation

Outline Why Do Data Assimilation? Who and What Important Concepts Definitions Brief History Common System Issues / Challenges

Page 3: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20083

The Purpose of Data Assimilation

Why do data assimilation?

Page 4: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20084

The Purpose of Data Assimilation

Why do data assimilation? (Answer: Common Sense)

Page 5: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20085

The Purpose of Data Assimilation

Why do data assimilation? (Answer: Common Sense)

MYTH: “It’s just an engineering tool”

Page 6: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20086

The Purpose of Data Assimilation

Why do data assimilation? (Answer: Common Sense)

MYTH: “It’s just an engineering tool”

If Truth matters,

“It’s our most important science tool”

Page 7: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20087

The Purpose of Data Assimilation

Why do data assimilation?

1. I want better model initial conditions for better model forecasts

Page 8: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20088

The Purpose of Data Assimilation

Why do data assimilation?

1. I want better model initial conditions for better model forecasts

2. I want better calibration and validation (cal/val)

Page 9: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20089

The Purpose of Data Assimilation

Why do data assimilation?

1. I want better model initial conditions for better model forecasts

2. I want better calibration and validation (cal/val)

3. I want better acquisition guidance

Page 10: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200810

The Purpose of Data Assimilation

Why do data assimilation?

1. I want better model initial conditions for better model forecasts

2. I want better calibration and validation (cal/val)

3. I want better acquisition guidance

4. I want better scientific understanding of Model errors (and their probability distributions)

Data errors (and their probability distributions)

Combined Model/Data correlations

DA methodologies (minimization, computational optimizations, representation methods, various method approximations)

Physical process interactions

Page 11: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200811

Why do data assimilation?

1. I want better model initial conditions for better model forecasts

2. I want better calibration and validation (cal/val)

3. I want better acquisition guidance

4. I want better scientific understanding of Model errors (and their probability distributions)

Data errors (and their probability distributions)

Combined Model/Data correlations

DA methodologies (minimization, computational optimizations, representation methods, various method approximations)

Physical process interactions (i.e., sensitivities and feedbacks)Leads toward better future models

The Purpose of Data Assimilation

Page 12: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200812

Why do data assimilation?

1. I want better model initial conditions for better model forecasts

2. I want better calibration and validation (cal/val)

3. I want better acquisition guidance

4. I want better scientific understanding of Model errors (and their probability distributions)

Data errors (and their probability distributions)

Combined Model/Data correlations

DA methodologies (minimization, computational optimizations, representation methods, various method approximations)

Physical process interactions (i.e., sensitivities and feedbacks)Leads toward better future models

VIRTUOUS CYCLE

The Purpose of Data Assimilation

Page 13: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200813

The Data Assimilation Community

Who is involved in data assimilation? NWP Data Assimilation Experts NWP Modelers Application and Observation Specialists Cloud Physicists / PBL Experts / NWP Parameterization Specialists Physical Scientists (Physical Algorithm Specialists) Radiative Transfer Specialists Applied Mathematicians / Control Theory Experts Computer Scientists Science Program Management (NWP and Science Disciplines) Forecasters Users and Customers

Page 14: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200814

The Data Assimilation Community

What skills are needed by each involved group? NWP Data Assimilation Experts (DA system methodology) NWP Modelers (Model + Physics + DA system) Application and Observation Specialists (Instrument capabilities) Physical Scientists (Instrument + Physics + DA system) Radiative Transfer Specialists (Instrument config. specifications) Applied Mathematicians (Control theory methodology) Computer Scientists (DA system + OPS time requirements) Science Program Management (Everything + $$ + Good People) Forecasters (Everything + OPS time reqs. + Easy/fast access) Users and Customers (Could be a wide variety of responses)

e.g., NWS / Army / USAF / Navy / NASA / NSF / DOE / ECMWF

Page 15: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200815

The Data Assimilation Community

Are you part of this community?

Page 16: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200816

The Data Assimilation Community

Are you part of this community? Yes, you just may not know it yet.

Page 17: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200817

The Data Assimilation Community

Are you part of this community? Yes, you just may not know it yet.

Who knows all about data assimilation?

Page 18: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200818

The Data Assimilation Community

Are you part of this community? Yes, you just may not know it yet.

Who knows all about data assimilation? No one knows it all, it takes many experts

Page 19: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200819

The Data Assimilation Community

Are you part of this community? Yes, you just may not know it yet.

Who knows all about data assimilation? No one knows it all, it takes many experts

How large are these systems?

Page 20: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200820

The Data Assimilation Community

Are you part of this community? Yes, you just may not know it yet.

Who knows all about data assimilation? No one knows it all, it takes many experts

How large are these systems? Typically, the DA systems are “medium”-sized projects

using software industry standards Medium = multi-year coding effort by several individuals

(e.g., RAMDAS is ~230K lines of code, ~3500 pages of code) Satellite “processing systems” tend to be larger still

Our CIRA Mesoscale 4DVAR system was built over ~7-8 years with heritage from the ETA 4DVAR system

Page 21: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200821

The Building Blocks of Data Assimilation

NWP Model

Observations

NWPAdjoint

Minimization

Observation ModelAdjoint

Control Variablesare the initial model

state variables

that are optimized

using the new data

information as a guide

They can also include

boundary condition

information, model

parameters for

“tuning”, etc.

Observation Model

Page 22: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200822

What Are We Minimizing?

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J Minimize discrepancy between model and observation data over time

The Cost Function, J, is the link between the

observational data and the model variables

Observations are either assumed unbiased, or

are “debiased” by some adjustment method

Page 23: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200823

Bayes Theorem

Maximum Conditional Probability is given by:

P (x | y) ~ P (y | x) P (x)

Assuming Gaussian distributions…

P (y | x) ~ exp {-1/2 [y – H (x)]T R-1 [y – H (x)]}

P (x) ~ exp {-1/2 [x –xb]T B-1 [x – xb]}

e.g.,

3DVAR

Lorenc (1986)

Page 24: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200824

What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time

Model Background or

Observations?

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 25: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200825

What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time

Model Background or

Observations?

Trust = WeightingsJust like your financial credit score!

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 26: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200826

Who are the Candidates for “Truth”? Minimize discrepancy between model and observation data over time

Candidate 1: Background Term“x0” is the model state vector at the initial time

t0

this is also the “control variable”,

the object of the minimization process

“xb” is the model background state vector

“B” is the background error covarianceof the forecast and model errors

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 27: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200827

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Candidate 2: Observational Term“y” is the observational vector, e.g., the satellite input data

(typically radiances), salinity, sounding profiles

“M0,i(x0)” is the model state at the observation time “i”

“h” is the observational operator, for example the“forward radiative transfer model”

“R” is the observational error covariance matrix that specifies the instrumental noise and data representation errors (currently assumed to be diagonal…)

Who are the Candidates for “Truth”? Minimize discrepancy between model and observation data over time

Page 28: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200828

What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time

Candidate 1: Background TermThe default condition for the assimilation when

1. data are not available or

2. the available data have no significant sensitivity to the model state or

3. the available data are inaccurate

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 29: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200829

Model Error Impacts our “Trust” Minimize discrepancy between model and observation data over time

Candidate 1: Background TermModel error issues are important

Model error varies as a function of the model time

Model error “grows” with time

Therefore the background term should be trusted more at the initial stages of the model run and trusted less at the end of the model run

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 30: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200830

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

How to Adjust for Model Error? Minimize discrepancy between model and observation data over time

Candidate 1: Background Term1. Add a model error term to the cost function so that the

weight at that specific model step is appropriately weighted or

2. Use other possible adjustments in the methodology, i.e., “make an assumption” about the model error impacts

If model error adjustments or controls are used the DA system is said to be “weakly constrained”

Page 31: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200831

What About Model Error Errors? Minimize discrepancy between model and observation data over time

Candidate 1: Background TermModel error adjustments to the weighting can be “wrong”

In particular, most assume some type of linearity

Non-linear physical processes may break these

assumptions and be more complexly interrelated

A data assimilation system with no model error control is said to

be “strongly constrained” (perfect model assumption)

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 32: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200832

What About other DA Errors?

Overlooked Issues?1. Data debiasing relative to the DA system

“reference”. It is not the “Truth”,however it is self-consistent.

2. DA Methodology Errors?1. Assumptions: Linearization, Gaussianity, Model

errors

2. Representation errors (space and time)

3. Poorly known background error covariances

4. Imperfect observational operators

5. Overly aggressive data “quality control”

6. Historical emphasis on dynamical impact vs. physical

Synoptic vs. Mesoscale?

Page 33: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200833

DA Theory is Still Maturing

The Future: Lognormal DA (Fletcher and Zupanski, 2006, 2007)

Gaussian systems typically force lognormal variables to become Gaussian introducing an avoidable data assimilation system bias

Many important variables

are lognormally distributed

Gaussian data assimilation system

variables are “Gaussian”

Add DA

Bias Here!

Lognormal

VariablesClouds

Precipitation

Water vapor

Emissivities

Many other

hydrologic

fields

Mode Mean

Page 34: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200834

What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time

Candidate 2: Observational TermThe non-default condition for the assimilation when

1. data are available and

2. data are sensitive to the model state and

3. data are precise (not necessarily “accurate”) and

4. data are not thrown away by DA “quality control” methods

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 35: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200835

What “Truth” Do We Have? Minimize discrepancy between model and observation data over time

DATA MODEL

CENTRIC CENTRIC

TRUTH

)x(xB)x(x2

1y))](x(h[MRy))](x(h[M

2

1 b0

1Tb00i0,

1T0i0,

(time) i

J

Page 36: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200836

DA Theory is Still Maturing

A Brief History of DA1. Hand Interpolation2. Local polynomial interpolation schemes

(e.g., Cressman)3. Use of “first guess”, i.e., a background4. Use of an “analysis cycle” to regenerate

a new first guess5. Empirical schemes, e.g., nudging6. Least squares methods

1. Variational DA (VAR)2. Sequential DA (KF)3. Monte Carlo Approx. to Seq. DA (EnsKF)

Page 37: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200837

Variational Techniques

Finds the maximum likelihood (if Gaussian, etc.)

(actually it is a minimum variance method)

Comes from setting the gradient of the cost function equal to zero

Control variable is xa

Major Flavors: 1DVAR (Z), 3DVAR (X,Y,Z), 4DVAR (X,Y,Z,T)

Lorenc (1986) and others…

Became the operational scheme in early 1990s to the present day

xH

xyRHHRHBxx

h

hTTba )]([)( 111

Page 38: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200838

Sequential Techniques

B is no longer static, B => Pf = forecast error covariance

Pa (ti) is estimated at future times using the model

K = “Kalman Gain” (in blue boxes)

Extended KF, Pa is found by linearizing the model about

the nonlinear trajectory of the model between ti-1 and ti

Kalman (1960) and many others…

These techniques can evolve the forecast error covariance fields

similar in concept to OI

)()}])([)({()(

)]}([}{])([)({)()(1

1

if

iTii

fii

Tii

fi

a

ifT

iif

iiTii

fi

fi

a

tttt

tHtttt

PHHPHRHPIP

xyHPHRHPxx

Page 39: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200839

Sequential Techniques

f is a particular forecast instance

l is the reference state forecast

Pf is estimated at future times using the model

K number model runs are required

(Q: How to populate the seed perturbations?)

Sampling allows for use of approximate solutions

Eliminates the need to linearize the model (as in Extended KF)

No tangent linear or adjoint models are needed

Ensembles can be used in KF-based sequential DA systems

Ensembles are used to estimate Pf through Gaussian “sampling” theory

K

lk

Tfl

fk

fl

fk

fl K

)()(2

1xxxxP

Page 40: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200840

Sequential Techniques

Notes on EnsKF-based sequential DA systems

1. EnsKFs are an approximation

2. Underlying theory is the KF

3. Assumes Gaussian distributions

4. Many ensemble samples are required

5. Can significantly improve Pf

6. Where does H fit in? Is it fully “resolved”?

7. What about the “Filter” aspects?

Future Directions Research using Hybrid EnsKF-Var techniques

Page 41: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200841

Sequential Techniques

NE is the number of ensembles

S is the state-space dimension

Each ensemble is carefully selected to represent thedegrees of freedom of the system

Square-root filter is built-in to the algorithm assumptions

Zupanski (2005): Maximum Likelihood Ensemble Filter (MLEF)

Structure function version of Ensemble-based DA

(Note: Does not use sampling theory, and is

more similar to a variational DA scheme using

principle component analysis (PCA)

][ 2121 f

Nff

f EpppP

fiS

fi

fi

fi

p

p

p

,

,2

,1

p

Page 42: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200842

Where is “M” in all of this?

3DDA Techniques have no explicit model time tendency information, it is all done implicitly with cycling techniques, typically focusing only on the Pf term

4DDA uses M explicitly via the model sensitivities, L, and model adjoints, LT,as a function of time

Kalman Smoothers (e.g., also 4DEnsKS) would likewise also need to estimate L and LT

No M

used

M used

Page 43: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200843

4DVAR Revisited(for an example see Poster NPOESS P1.16 by Jones et al.)

LT is the adjoint which is integrated from ti to t0

Adjoints are NOT the “model running in reverse”,

but merely the model sensitivities being integrated

in reverse order, thus all adjoints appear to function

backwards. Think of it as accumulating the

“impacts” back toward the initial control variables.

Automatically propagates the Pf within the cycle, however can not save the result for the next analysis cycle (memory of “B” info becomes lost in the next cycle) (Thepaut et al., 1993)

)(

})]([{

00

0,01

0

0,0,01

0

t

M

h

MhJ

ii

ii

iiiTi

N

i

Tib

xx

xL

xH

yxRHLxxBx

Page 44: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200844

Minimization Process

TRUTH

Jacobian of the Cost Function is used in the minimization procedure

Minima is at J/ x = 0

Issues:

Is it a global minima?

Are we converging rapidor slow?

J

x

Page 45: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200845

obs1

obs2

xtime

Geographically distant observations can bring more information than close-by observations, if in a dynamically significant region

grid-point

Ensembles: Flow-dependent forecast error covariance and spread of information from observations

t0

t1

t2

Isotropic

correlations

From M. Zupanski

Page 46: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200846

J=const.

min

x0

xmin

J=const.

Physical space

Preconditioning space

-g

-gx

Preconditioning the Space

“Preconditioners” transform the

variable space so that fewer iterations

are required while minimizing the cost function

x ->

Result: faster convergence

From M. Zupanski

Page 47: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200847

Incremental VARCourtier et al. (1994)

Most Common 4D framework in operational use

Incremental form performs Linear minimization within a lower dimensional space (the inner loop minimization)

Outer loop minimization is at the full model resolution(non-linear physics are added back in this stage)

Benefits:

Smoothes the cost function and assures betterminimization behaviors

Reduces the need for explicit preconditioning

Issues: Extra linearizations occur It is an approximate form of VAR DA

Page 48: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200848

Types of DA Solution Spaces

1. Model Space (x)

2. Physical Space (y)

3. Ensemble Sub-spacee.g., Maximum Likelihood Ensemble Filter (MLEF)

Types of Ensemble Kalman Filters1. Perturbed observations (or stochastic)

2. Square root filters (i.e., analysis perturbations are obtained from the Square root of the Kalman Filter analysis covariance)

Page 49: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200849

How are Data used in Time?

Assimilation time window

observations

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

time

forecast

Page 50: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200850

Assimilation time window

observationstime

forecast

A “Smoother” Uses All Data Availablein the Assimilation Window

(a “Simultaneous” Solution)

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

Page 51: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200851

Assimilation time window

observationstime

forecast

A “Filter” Sequentially Assimilates Dataas it Becomes Available in each Cycle

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

Page 52: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200852

Assimilation time window

observationstime

forecastCycle Previous

Information

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

A “Filter” Sequentially Assimilates Data

as it Becomes Available in each Cycle

Page 53: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200853

Assimilation time window

observationstime

forecastCycle Previous

Information

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

A “Filter” Sequentially Assimilates Data

as it Becomes Available in each Cycle

Page 54: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200854

Cycle Physics “Barriers”What Can Overcome the Barrier?1. Linear Physics Processes and2. Propagated Forecast Error Covariances

time

forecast

)(ε)(xx

ε)](x[y

x01-i0,i

y0i0,

GM

Mh

Observation model

Cloud resolving model

A “Filter” Sequentially Assimilates Data

as it Becomes Available in each Cycle

Page 55: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200855

Data AssimilationConclusions Broad, Dynamic,

Evolving, Foundational Science Field!

Flexible unified frameworks, standards, and funding will improve training and education

Continued need for advanced DA systemsfor research purposes(non-OPS)

Can share OPS framework components,e.g., JCSDA CRTM

Data Assimilation

Thanks! ([email protected])

For more information…Great NWP DA Review Paper (By Mike Navon)

ECMWF DA training materials

JCSDA DA workshophttp://people.scs.fsu.edu/~navon/pubs/JCP1229.pdf

http://www.ecmwf.int/newsevents/training/rcourse_notes/

http://www.weatherchaos.umd.edu/workshop/

Page 56: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200856

Backup Slides

Or: Why Observationalists and Modelers see things differently…

No offense meant for either side

Page 57: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200857

What Approach Should We Use?

DATA MODEL

CENTRIC CENTRIC

TRUTH

Page 58: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200858

What Approach Should We Use?

DATA

MODEL

CENTRIC CENTRIC

TRUTH

Page 59: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200859

My My PrecioPrecious…us…

We Trust the We Trust the Model!Model!

Data hurts Data hurts us!,us!,

Yes…Yes…

What Approach Should We Use?

DATA

MODEL

CENTRIC CENTRIC

TRUTH

Page 60: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200860

MODEL CENTRIC FOCUS

DATA MODEL

CENTRIC CENTRIC

FOCUS ON“B” Background Error

Improvements are Needed

“xb” Associated background states and “Cycling” are more heavily emphasized

DA method selection tends toward sequential estimators, “filters”, and improved representation of the forecast model error covariances

E.g., Ensemble Kalman Filters,

other Ensemble Filter systems

Page 61: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200861

What Approach Should We Use?

DATA MODEL

CENTRIC CENTRIC

TRUTH

This is not to say that

all model-centric improvements

are bad…

Page 62: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200862

What Approach Should We Use?

DATA

MODEL

CENTRIC

CENTRIC

TRUTH

Page 63: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200863

My Precious…My Precious…

We Trust the Data!We Trust the Data!

Models unfair and Models unfair and hurts us!,hurts us!,

Yes…Yes…

What Approach Should We Use?

DATA

MODEL

CENTRIC

CENTRIC

TRUTH

Page 64: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200864

DATA CENTRIC FOCUS

DATA DATA

CENTRIC CENTRIC

FOCUS ON“h” Observational Operator

Improvements are Needed

“M0,i(x0)” Model state capabilities and independent experimental validation is more heavily emphasized

DA method selection tends toward “smoothers” (less focus on model cycling), more emphasis on data quantity and improvements in the data operator and understanding of data representation errors e.g., 4DVAR systems

Page 65: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200865

DUAL-CENTRIC FOCUS

Best of both worlds?

Solution: Ensemble based forecast covariance estimates combined with 4DVAR smoother for research and 4DVAR filter for operations?

Several frameworks to combine the two approaches are in various stages of development now…

Page 66: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200866

What Have We Learned?

DATA MODEL

CENTRIC CENTRIC

TRUTH

Your Research Objective is CRITICAL to making the right choices…

1. Operational choices may supercede good research objectives

2. Computational speed is always critical for operational purposes

3. Accuracy is critical for research purposes

Page 67: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200867

What About Model Error Errors?

A Strongly

Constrained

System?

Can Data Over

Constrain?

““I just can’t run I just can’t run like I used to.”like I used to.”

Model“Little Data People”

Page 68: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200868

What About Model Error Errors?

A Strongly

Constrained

System?

Can Data Over

Constrain?

““We’ll… no one’s We’ll… no one’s perfect.”perfect.”

DA expert

Page 69: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200869

Optimal Interpolation (OI)

)]([a bb h xyWxx

OI merely means finding the “optimal” Weights, W

Eliassen (1954), Bengtsson et al. (1981), Gandin (1963)

Became the operational scheme in early 1980s and early 1990s

1)( RHBHBHW TT

A better name would have been “statistical interpolation”

Page 70: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200870

What is a Hessian?

A Rank-2 Square Matrix

Containing the Partial Derivatives of the Jacobian

G(f)ij(x) = DiDj f(x)

The Hessian is used in some minimization methods,e.g., quasi-Newton…

)()( 11 nnnn ff xxGxx

Page 71: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200871

The Role of the Adjoint, etc.Adjoints are used in the cost function minimization procedure

But first…

Tangent Linear Models are used to approximate the non-linear model behaviors

L x’ = [M(x1) – M(x2)] / L is the linear operator of the perturbation model

M is the non-linear forward model is the perturbation scaling-factorx2 = x1 + x’

Page 72: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200872

Useful Properties of the Adjoint

<Lx’, Lx’> <LTLx’, x’> LT is the adjoint operator of the perturbation model

Typically the adjoint and the tangent linear operator can be automatically created using automated compilers

y = (x1, …, xn, y)

*xi = *xi + *y /xi

*y = *y /y where *xi and *y are

the “adjoint” variables

Page 73: DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.

DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200873

Useful Properties of the Adjoint

<Lx’, Lx’> <LTLx’, x’> LT is the adjoint operator of the perturbation model

Typically the adjoint and the tangent linear operator can be automatically created using automated compilers

Of course, automated methods fail for complex variable types

(See Jones et al., 2004)

E.g., how can the compiler know when the variable is complex, when codes are decomposed into real and imaginary parts as common practice? (It can’t.)