DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots of help also from: Steven Fletcher, Laura Fowler, Tarendra Lakhankar, Scott Longmore, Manajit Sengupta, Tom Vonder Haar, Dusanka Zupanski, and Milija Zupanski
73
Embed
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 2008 1 What is Data Assimilation? A Tutorial Andrew S. Jones Lots.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20081
What is Data Assimilation?A Tutorial
Andrew S. Jones
Lots of help also from:
Steven Fletcher, Laura Fowler, Tarendra Lakhankar, Scott
Longmore, Manajit Sengupta, Tom Vonder Haar, Dusanka
Zupanski, and Milija Zupanski
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20082
Data Assimilation
Outline Why Do Data Assimilation? Who and What Important Concepts Definitions Brief History Common System Issues / Challenges
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20083
The Purpose of Data Assimilation
Why do data assimilation?
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20084
The Purpose of Data Assimilation
Why do data assimilation? (Answer: Common Sense)
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20085
The Purpose of Data Assimilation
Why do data assimilation? (Answer: Common Sense)
MYTH: “It’s just an engineering tool”
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20086
The Purpose of Data Assimilation
Why do data assimilation? (Answer: Common Sense)
MYTH: “It’s just an engineering tool”
If Truth matters,
“It’s our most important science tool”
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20087
The Purpose of Data Assimilation
Why do data assimilation?
1. I want better model initial conditions for better model forecasts
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20088
The Purpose of Data Assimilation
Why do data assimilation?
1. I want better model initial conditions for better model forecasts
2. I want better calibration and validation (cal/val)
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 20089
The Purpose of Data Assimilation
Why do data assimilation?
1. I want better model initial conditions for better model forecasts
2. I want better calibration and validation (cal/val)
3. I want better acquisition guidance
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200810
The Purpose of Data Assimilation
Why do data assimilation?
1. I want better model initial conditions for better model forecasts
2. I want better calibration and validation (cal/val)
3. I want better acquisition guidance
4. I want better scientific understanding of Model errors (and their probability distributions)
Data errors (and their probability distributions)
Combined Model/Data correlations
DA methodologies (minimization, computational optimizations, representation methods, various method approximations)
Physical process interactions
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200811
Why do data assimilation?
1. I want better model initial conditions for better model forecasts
2. I want better calibration and validation (cal/val)
3. I want better acquisition guidance
4. I want better scientific understanding of Model errors (and their probability distributions)
Data errors (and their probability distributions)
Combined Model/Data correlations
DA methodologies (minimization, computational optimizations, representation methods, various method approximations)
Physical process interactions (i.e., sensitivities and feedbacks)Leads toward better future models
The Purpose of Data Assimilation
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200812
Why do data assimilation?
1. I want better model initial conditions for better model forecasts
2. I want better calibration and validation (cal/val)
3. I want better acquisition guidance
4. I want better scientific understanding of Model errors (and their probability distributions)
Data errors (and their probability distributions)
Combined Model/Data correlations
DA methodologies (minimization, computational optimizations, representation methods, various method approximations)
Physical process interactions (i.e., sensitivities and feedbacks)Leads toward better future models
VIRTUOUS CYCLE
The Purpose of Data Assimilation
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200813
The Data Assimilation Community
Who is involved in data assimilation? NWP Data Assimilation Experts NWP Modelers Application and Observation Specialists Cloud Physicists / PBL Experts / NWP Parameterization Specialists Physical Scientists (Physical Algorithm Specialists) Radiative Transfer Specialists Applied Mathematicians / Control Theory Experts Computer Scientists Science Program Management (NWP and Science Disciplines) Forecasters Users and Customers
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200814
The Data Assimilation Community
What skills are needed by each involved group? NWP Data Assimilation Experts (DA system methodology) NWP Modelers (Model + Physics + DA system) Application and Observation Specialists (Instrument capabilities) Physical Scientists (Instrument + Physics + DA system) Radiative Transfer Specialists (Instrument config. specifications) Applied Mathematicians (Control theory methodology) Computer Scientists (DA system + OPS time requirements) Science Program Management (Everything + $$ + Good People) Forecasters (Everything + OPS time reqs. + Easy/fast access) Users and Customers (Could be a wide variety of responses)
e.g., NWS / Army / USAF / Navy / NASA / NSF / DOE / ECMWF
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200815
The Data Assimilation Community
Are you part of this community?
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200816
The Data Assimilation Community
Are you part of this community? Yes, you just may not know it yet.
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200817
The Data Assimilation Community
Are you part of this community? Yes, you just may not know it yet.
Who knows all about data assimilation?
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200818
The Data Assimilation Community
Are you part of this community? Yes, you just may not know it yet.
Who knows all about data assimilation? No one knows it all, it takes many experts
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200819
The Data Assimilation Community
Are you part of this community? Yes, you just may not know it yet.
Who knows all about data assimilation? No one knows it all, it takes many experts
How large are these systems?
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200820
The Data Assimilation Community
Are you part of this community? Yes, you just may not know it yet.
Who knows all about data assimilation? No one knows it all, it takes many experts
How large are these systems? Typically, the DA systems are “medium”-sized projects
using software industry standards Medium = multi-year coding effort by several individuals
(e.g., RAMDAS is ~230K lines of code, ~3500 pages of code) Satellite “processing systems” tend to be larger still
Our CIRA Mesoscale 4DVAR system was built over ~7-8 years with heritage from the ETA 4DVAR system
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200821
The Building Blocks of Data Assimilation
NWP Model
Observations
NWPAdjoint
Minimization
Observation ModelAdjoint
Control Variablesare the initial model
state variables
that are optimized
using the new data
information as a guide
They can also include
boundary condition
information, model
parameters for
“tuning”, etc.
Observation Model
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200822
What Are We Minimizing?
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J Minimize discrepancy between model and observation data over time
The Cost Function, J, is the link between the
observational data and the model variables
Observations are either assumed unbiased, or
are “debiased” by some adjustment method
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200823
Bayes Theorem
Maximum Conditional Probability is given by:
P (x | y) ~ P (y | x) P (x)
Assuming Gaussian distributions…
P (y | x) ~ exp {-1/2 [y – H (x)]T R-1 [y – H (x)]}
P (x) ~ exp {-1/2 [x –xb]T B-1 [x – xb]}
e.g.,
3DVAR
Lorenc (1986)
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200824
What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time
Model Background or
Observations?
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200825
What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time
Model Background or
Observations?
Trust = WeightingsJust like your financial credit score!
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200826
Who are the Candidates for “Truth”? Minimize discrepancy between model and observation data over time
Candidate 1: Background Term“x0” is the model state vector at the initial time
t0
this is also the “control variable”,
the object of the minimization process
“xb” is the model background state vector
“B” is the background error covarianceof the forecast and model errors
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200827
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
Candidate 2: Observational Term“y” is the observational vector, e.g., the satellite input data
“M0,i(x0)” is the model state at the observation time “i”
“h” is the observational operator, for example the“forward radiative transfer model”
“R” is the observational error covariance matrix that specifies the instrumental noise and data representation errors (currently assumed to be diagonal…)
Who are the Candidates for “Truth”? Minimize discrepancy between model and observation data over time
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200828
What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time
Candidate 1: Background TermThe default condition for the assimilation when
1. data are not available or
2. the available data have no significant sensitivity to the model state or
3. the available data are inaccurate
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200829
Model Error Impacts our “Trust” Minimize discrepancy between model and observation data over time
Candidate 1: Background TermModel error issues are important
Model error varies as a function of the model time
Model error “grows” with time
Therefore the background term should be trusted more at the initial stages of the model run and trusted less at the end of the model run
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200830
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
How to Adjust for Model Error? Minimize discrepancy between model and observation data over time
Candidate 1: Background Term1. Add a model error term to the cost function so that the
weight at that specific model step is appropriately weighted or
2. Use other possible adjustments in the methodology, i.e., “make an assumption” about the model error impacts
If model error adjustments or controls are used the DA system is said to be “weakly constrained”
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200831
What About Model Error Errors? Minimize discrepancy between model and observation data over time
Candidate 1: Background TermModel error adjustments to the weighting can be “wrong”
In particular, most assume some type of linearity
Non-linear physical processes may break these
assumptions and be more complexly interrelated
A data assimilation system with no model error control is said to
be “strongly constrained” (perfect model assumption)
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200832
What About other DA Errors?
Overlooked Issues?1. Data debiasing relative to the DA system
“reference”. It is not the “Truth”,however it is self-consistent.
2. DA Methodology Errors?1. Assumptions: Linearization, Gaussianity, Model
errors
2. Representation errors (space and time)
3. Poorly known background error covariances
4. Imperfect observational operators
5. Overly aggressive data “quality control”
6. Historical emphasis on dynamical impact vs. physical
Synoptic vs. Mesoscale?
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200833
DA Theory is Still Maturing
The Future: Lognormal DA (Fletcher and Zupanski, 2006, 2007)
Gaussian systems typically force lognormal variables to become Gaussian introducing an avoidable data assimilation system bias
Many important variables
are lognormally distributed
Gaussian data assimilation system
variables are “Gaussian”
Add DA
Bias Here!
Lognormal
VariablesClouds
Precipitation
Water vapor
Emissivities
Many other
hydrologic
fields
Mode Mean
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200834
What Do We Trust for “Truth”? Minimize discrepancy between model and observation data over time
Candidate 2: Observational TermThe non-default condition for the assimilation when
1. data are available and
2. data are sensitive to the model state and
3. data are precise (not necessarily “accurate”) and
4. data are not thrown away by DA “quality control” methods
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200835
What “Truth” Do We Have? Minimize discrepancy between model and observation data over time
DATA MODEL
CENTRIC CENTRIC
TRUTH
)x(xB)x(x2
1y))](x(h[MRy))](x(h[M
2
1 b0
1Tb00i0,
1T0i0,
(time) i
J
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200836
DA Theory is Still Maturing
A Brief History of DA1. Hand Interpolation2. Local polynomial interpolation schemes
(e.g., Cressman)3. Use of “first guess”, i.e., a background4. Use of an “analysis cycle” to regenerate
a new first guess5. Empirical schemes, e.g., nudging6. Least squares methods
1. Variational DA (VAR)2. Sequential DA (KF)3. Monte Carlo Approx. to Seq. DA (EnsKF)
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200837
Variational Techniques
Finds the maximum likelihood (if Gaussian, etc.)
(actually it is a minimum variance method)
Comes from setting the gradient of the cost function equal to zero
Control variable is xa
Major Flavors: 1DVAR (Z), 3DVAR (X,Y,Z), 4DVAR (X,Y,Z,T)
Lorenc (1986) and others…
Became the operational scheme in early 1990s to the present day
xH
xyRHHRHBxx
h
hTTba )]([)( 111
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200838
Sequential Techniques
B is no longer static, B => Pf = forecast error covariance
Pa (ti) is estimated at future times using the model
K = “Kalman Gain” (in blue boxes)
Extended KF, Pa is found by linearizing the model about
the nonlinear trajectory of the model between ti-1 and ti
Kalman (1960) and many others…
These techniques can evolve the forecast error covariance fields
similar in concept to OI
)()}])([)({()(
)]}([}{])([)({)()(1
1
if
iTii
fii
Tii
fi
a
ifT
iif
iiTii
fi
fi
a
tttt
tHtttt
PHHPHRHPIP
xyHPHRHPxx
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200839
Sequential Techniques
f is a particular forecast instance
l is the reference state forecast
Pf is estimated at future times using the model
K number model runs are required
(Q: How to populate the seed perturbations?)
Sampling allows for use of approximate solutions
Eliminates the need to linearize the model (as in Extended KF)
No tangent linear or adjoint models are needed
Ensembles can be used in KF-based sequential DA systems
Ensembles are used to estimate Pf through Gaussian “sampling” theory
K
lk
Tfl
fk
fl
fk
fl K
)()(2
1xxxxP
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200840
Sequential Techniques
Notes on EnsKF-based sequential DA systems
1. EnsKFs are an approximation
2. Underlying theory is the KF
3. Assumes Gaussian distributions
4. Many ensemble samples are required
5. Can significantly improve Pf
6. Where does H fit in? Is it fully “resolved”?
7. What about the “Filter” aspects?
Future Directions Research using Hybrid EnsKF-Var techniques
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200841
Sequential Techniques
NE is the number of ensembles
S is the state-space dimension
Each ensemble is carefully selected to represent thedegrees of freedom of the system
Square-root filter is built-in to the algorithm assumptions
Zupanski (2005): Maximum Likelihood Ensemble Filter (MLEF)
Structure function version of Ensemble-based DA
(Note: Does not use sampling theory, and is
more similar to a variational DA scheme using
principle component analysis (PCA)
][ 2121 f
Nff
f EpppP
fiS
fi
fi
fi
p
p
p
,
,2
,1
p
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200842
Where is “M” in all of this?
3DDA Techniques have no explicit model time tendency information, it is all done implicitly with cycling techniques, typically focusing only on the Pf term
4DDA uses M explicitly via the model sensitivities, L, and model adjoints, LT,as a function of time
Kalman Smoothers (e.g., also 4DEnsKS) would likewise also need to estimate L and LT
No M
used
M used
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200843
4DVAR Revisited(for an example see Poster NPOESS P1.16 by Jones et al.)
LT is the adjoint which is integrated from ti to t0
Adjoints are NOT the “model running in reverse”,
but merely the model sensitivities being integrated
in reverse order, thus all adjoints appear to function
backwards. Think of it as accumulating the
“impacts” back toward the initial control variables.
Automatically propagates the Pf within the cycle, however can not save the result for the next analysis cycle (memory of “B” info becomes lost in the next cycle) (Thepaut et al., 1993)
)(
})]([{
00
0,01
0
0,0,01
0
t
M
h
MhJ
ii
ii
iiiTi
N
i
Tib
xx
xL
xH
yxRHLxxBx
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200844
Minimization Process
TRUTH
Jacobian of the Cost Function is used in the minimization procedure
Minima is at J/ x = 0
Issues:
Is it a global minima?
Are we converging rapidor slow?
J
x
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200845
obs1
obs2
xtime
Geographically distant observations can bring more information than close-by observations, if in a dynamically significant region
grid-point
Ensembles: Flow-dependent forecast error covariance and spread of information from observations
t0
t1
t2
Isotropic
correlations
From M. Zupanski
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200846
J=const.
min
x0
xmin
J=const.
Physical space
Preconditioning space
-g
-gx
Preconditioning the Space
“Preconditioners” transform the
variable space so that fewer iterations
are required while minimizing the cost function
x ->
Result: faster convergence
From M. Zupanski
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200847
Incremental VARCourtier et al. (1994)
Most Common 4D framework in operational use
Incremental form performs Linear minimization within a lower dimensional space (the inner loop minimization)
Outer loop minimization is at the full model resolution(non-linear physics are added back in this stage)
Benefits:
Smoothes the cost function and assures betterminimization behaviors
Reduces the need for explicit preconditioning
Issues: Extra linearizations occur It is an approximate form of VAR DA
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200848
Types of DA Solution Spaces
1. Model Space (x)
2. Physical Space (y)
3. Ensemble Sub-spacee.g., Maximum Likelihood Ensemble Filter (MLEF)
Types of Ensemble Kalman Filters1. Perturbed observations (or stochastic)
2. Square root filters (i.e., analysis perturbations are obtained from the Square root of the Kalman Filter analysis covariance)
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200849
How are Data used in Time?
Assimilation time window
observations
)(ε)(xx
ε)](x[y
x01-i0,i
y0i0,
GM
Mh
Observation model
Cloud resolving model
time
forecast
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200850
Assimilation time window
observationstime
forecast
A “Smoother” Uses All Data Availablein the Assimilation Window
(a “Simultaneous” Solution)
)(ε)(xx
ε)](x[y
x01-i0,i
y0i0,
GM
Mh
Observation model
Cloud resolving model
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200851
Assimilation time window
observationstime
forecast
A “Filter” Sequentially Assimilates Dataas it Becomes Available in each Cycle
)(ε)(xx
ε)](x[y
x01-i0,i
y0i0,
GM
Mh
Observation model
Cloud resolving model
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200852
Assimilation time window
observationstime
forecastCycle Previous
Information
)(ε)(xx
ε)](x[y
x01-i0,i
y0i0,
GM
Mh
Observation model
Cloud resolving model
A “Filter” Sequentially Assimilates Data
as it Becomes Available in each Cycle
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200853
Assimilation time window
observationstime
forecastCycle Previous
Information
)(ε)(xx
ε)](x[y
x01-i0,i
y0i0,
GM
Mh
Observation model
Cloud resolving model
A “Filter” Sequentially Assimilates Data
as it Becomes Available in each Cycle
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200854
Cycle Physics “Barriers”What Can Overcome the Barrier?1. Linear Physics Processes and2. Propagated Forecast Error Covariances
time
forecast
)(ε)(xx
ε)](x[y
x01-i0,i
y0i0,
GM
Mh
Observation model
Cloud resolving model
A “Filter” Sequentially Assimilates Data
as it Becomes Available in each Cycle
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200855
Data AssimilationConclusions Broad, Dynamic,
Evolving, Foundational Science Field!
Flexible unified frameworks, standards, and funding will improve training and education
Continued need for advanced DA systemsfor research purposes(non-OPS)
Can share OPS framework components,e.g., JCSDA CRTM
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200856
Backup Slides
Or: Why Observationalists and Modelers see things differently…
No offense meant for either side
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200857
What Approach Should We Use?
DATA MODEL
CENTRIC CENTRIC
TRUTH
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200858
What Approach Should We Use?
DATA
MODEL
CENTRIC CENTRIC
TRUTH
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200859
My My PrecioPrecious…us…
We Trust the We Trust the Model!Model!
Data hurts Data hurts us!,us!,
Yes…Yes…
What Approach Should We Use?
DATA
MODEL
CENTRIC CENTRIC
TRUTH
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200860
MODEL CENTRIC FOCUS
DATA MODEL
CENTRIC CENTRIC
FOCUS ON“B” Background Error
Improvements are Needed
“xb” Associated background states and “Cycling” are more heavily emphasized
DA method selection tends toward sequential estimators, “filters”, and improved representation of the forecast model error covariances
E.g., Ensemble Kalman Filters,
other Ensemble Filter systems
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200861
What Approach Should We Use?
DATA MODEL
CENTRIC CENTRIC
TRUTH
This is not to say that
all model-centric improvements
are bad…
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200862
What Approach Should We Use?
DATA
MODEL
CENTRIC
CENTRIC
TRUTH
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200863
My Precious…My Precious…
We Trust the Data!We Trust the Data!
Models unfair and Models unfair and hurts us!,hurts us!,
Yes…Yes…
What Approach Should We Use?
DATA
MODEL
CENTRIC
CENTRIC
TRUTH
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200864
DATA CENTRIC FOCUS
DATA DATA
CENTRIC CENTRIC
FOCUS ON“h” Observational Operator
Improvements are Needed
“M0,i(x0)” Model state capabilities and independent experimental validation is more heavily emphasized
DA method selection tends toward “smoothers” (less focus on model cycling), more emphasis on data quantity and improvements in the data operator and understanding of data representation errors e.g., 4DVAR systems
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200865
DUAL-CENTRIC FOCUS
Best of both worlds?
Solution: Ensemble based forecast covariance estimates combined with 4DVAR smoother for research and 4DVAR filter for operations?
Several frameworks to combine the two approaches are in various stages of development now…
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200866
What Have We Learned?
DATA MODEL
CENTRIC CENTRIC
TRUTH
Your Research Objective is CRITICAL to making the right choices…
1. Operational choices may supercede good research objectives
2. Computational speed is always critical for operational purposes
3. Accuracy is critical for research purposes
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200867
What About Model Error Errors?
A Strongly
Constrained
System?
Can Data Over
Constrain?
““I just can’t run I just can’t run like I used to.”like I used to.”
Model“Little Data People”
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200868
What About Model Error Errors?
A Strongly
Constrained
System?
Can Data Over
Constrain?
““We’ll… no one’s We’ll… no one’s perfect.”perfect.”
DA expert
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200869
Optimal Interpolation (OI)
)]([a bb h xyWxx
OI merely means finding the “optimal” Weights, W
Eliassen (1954), Bengtsson et al. (1981), Gandin (1963)
Became the operational scheme in early 1980s and early 1990s
1)( RHBHBHW TT
A better name would have been “statistical interpolation”
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200870
What is a Hessian?
A Rank-2 Square Matrix
Containing the Partial Derivatives of the Jacobian
G(f)ij(x) = DiDj f(x)
The Hessian is used in some minimization methods,e.g., quasi-Newton…
)()( 11 nnnn ff xxGxx
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200871
The Role of the Adjoint, etc.Adjoints are used in the cost function minimization procedure
But first…
Tangent Linear Models are used to approximate the non-linear model behaviors
L x’ = [M(x1) – M(x2)] / L is the linear operator of the perturbation model
M is the non-linear forward model is the perturbation scaling-factorx2 = x1 + x’
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200872
Useful Properties of the Adjoint
<Lx’, Lx’> <LTLx’, x’> LT is the adjoint operator of the perturbation model
Typically the adjoint and the tangent linear operator can be automatically created using automated compilers
y = (x1, …, xn, y)
*xi = *xi + *y /xi
*y = *y /y where *xi and *y are
the “adjoint” variables
DoD CG/AR at Colorado State University AMS Data Assimilation Education Forum January 21, 200873
Useful Properties of the Adjoint
<Lx’, Lx’> <LTLx’, x’> LT is the adjoint operator of the perturbation model
Typically the adjoint and the tangent linear operator can be automatically created using automated compilers
Of course, automated methods fail for complex variable types
(See Jones et al., 2004)
E.g., how can the compiler know when the variable is complex, when codes are decomposed into real and imaginary parts as common practice? (It can’t.)