ENSEMBLE FORECASTING – A NEW PARADIGM IN NUMERICAL WEATHER PREDICTION Zoltan Toth Global Systems Division, ESRL/OAR/NOAA Acknowledgements: Isidora Jankov, Malaquias Pena, Yuanfu Xie, Paula McCaslin, Paul Schultz, Linda Wharton, Roman Krysztofowicz, Yuejian Zhu, Andre Methot, Tom Hamill, Kathy Gilbert, et al.
83
Embed
ENSEMBLE FORECASTING – A NEW PARADIGM IN NUMERICAL WEATHER PREDICTION Zoltan Toth Global Systems Division, ESRL/OAR/NOAA Acknowledgements: Isidora Jankov,
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
ENSEMBLE FORECASTING – A NEW PARADIGM IN
NUMERICAL WEATHER PREDICTION
Zoltan Toth
Global Systems Division, ESRL/OAR/NOAA
Acknowledgements:
Isidora Jankov, Malaquias Pena,Yuanfu Xie, Paula McCaslin, Paul Schultz, Linda Wharton, Roman
Krysztofowicz, Yuejian Zhu, Andre Methot, Tom Hamill, Kathy Gilbert, et al.
INTRODUCTION• Share experience / perspective
– 19 yrs at NCEP/EMC• Operational needs
– Over a year at NOAA research lab (GSD)• Research opportunities
• Review host of NWP research / development issues pursued at NOAA– Some closer, others further from operational use
• Highlight collaborative opportunities– Research thrives on exchange of ideas
OUTLINE / SUMMARY• Why ensembles?
– Knowledge about uncertainty• Complete the forecast with probabilistic information• User needs for covariance / scenarios
• Capturing uncertainty in initial conditions – Focus on dynamical consistency in perturbations– Derive estimate of error variance in best analysis
• New NWP modeling paradigm– Focus on ensemble (not single value) forecasting– Stochastically represent effect of unresolved scales
• Choice of DA scheme– Variational or sequential?– Both can use ensemble-based covariances
• Challenges in data assimilation– Careful attention to many details related to
• Forward operators, control variables, DA/forecast cycle, moist constraints, dynamical consistency (initialization of forecast with imperfect models), etc
– Ensemble-based covariances – “hybrid” mehod?
DATA ASSIMILATION / ENSEMBLE FORECASTING PREAMBLE
• Objective– Probabilistic / ensemble estimate of current /
future state of system• Ensemble of gridded multivariate 3D fields
• Approach– Estimate initial state – Data assimilation
• Represent via ensemble
– Project initial ensemble into future – Ensemble forecasting
• Capture model related uncertainty
NUMERICAL WEATHER PREDICTION (NWP) BASICS
COMPONENTS OF NWP• Create initial condition reflecting state of the atmosphere, land, ocean• Create numerical model of atmosphere, land, ocean
ANALYSIS OF ERRORS• Errors present in both initial conditions and numerical models• Coupled atmosphere / land / ocean dynamical system is chaotic
– Any error amplifies exponentially until nonlinearly saturated– Error behavior is complex & depends on
• Nature of instabilities• Nonlinear saturation
IMPACT ON USERS• Analysis / forecast errors negatively impact users
– Impact is user specific (user cost / loss situation)• Information on expected forecast errors needed for rational decision making
– Spatial/temporal/cross-variable error covariance needed for many real life applications– How can we provide information on expected forecast errors?
WHAT INFORMATION USERS NEED• General characteristics of forecast users
– Each user affected in specific way by• Various weather elements at • Different points in time & • Space
• Requirements for optimal decision making for weather sensitive operation– Probability distributions for single variables
• Lack of information on cross-correlations– Covariances needed across
• Forecast variables, space, and time
• Format of weather forecasts– Joint probability distributions
• Provision of all joint distributions possibly needed by users is intractable– Encapsulate best forecast info into calibrated ensemble members
• Possible weather scenarios – 6-Dimensional Data-Cube (6DDC)
» 3 dimensions for space, 1 each for time, variable, and ensemble members
• Provision of weather information– Ensemble members for sophisticated users
• Other types of format derived from ensemble data– All forecast information fully consistent with calibrated ensemble data
HOW CAN WE REDUCE & ESTIMATE EXPECTED FORECAST ERRORS?
STATISTICAL APPROACH• Statistically assess errors in past unperturbed forecasts (eg, GFS, RUC)
– Can correct for systematic errors in expected value– Can create probabilistic forecast information – Eg, MOS PoP
• Limitation– Case dependent variations in skill not captured– Error covariance information practically not attainable
DYNAMICAL APPROACH – Ensemble forecasting• Sample initial & model error space - Monte Carlo approach
• Pre-location of spare parts & other assets where needed– Reservations
• Rebooking of passengers– Customer service
• Compensation of severely affected customers
• How to design economically most viable operations?– Given goals / requirements / metrics / constraints
SELECTION OF OPTIMAL USER PROCEDURES• Generate ensemble weather scenarios ei, i = 1, n• Assume weather is ei, define optimal operation procedures oi
• Assess cost/loss cij using oi over all weather scenarios ej
• Select oi with minimum expected (mean) cost/loss ci over e1,…en as optimum operation
COST/LOSS cij GIVEN ej
WEATHER & oi OPERATIONS
ENSEMBLE SCENARIOS
e1 e2 . en
o1 c11 c12 . cn c1
o2 c21 c22 . c2n c2
. . . . . .
on cn1 cn2 . cnn cn
OP
ER
AT
ION
PR
OC
ED
UR
ES
EX
PE
CT
ED
CO
ST
DATA ASSIMILATION BASICS• Two distinct objectives
– Reproduce reality as faithfully as possible• May hinder NWP forecast application
– Create initial condition leading to best NWP forecast• Initial state must be consistent with model dynamics
• Technique– Probabilistic / ensemble estimate of current / future state of system
• Ensemble of gridded multivariate 3D fields– Bayesian combination of “prior” & observations
• Must have error estimates for both prior & observations
• Basic functionalities (steps) needed– Relate observations to model variables
• Forward operators– Combine information from various observing systems into “superob”
• For each model variable• Accurate error estimates needed
– Spread effect of observations across time/space/variables• Use dynamical constraints, ensemble-based covariances, etc
– Combine prior and observationally based analysis• Use error estimates
CHOICE OF DA SCHEME• Criteria
– Actual or expected quality of performance• Results / expectations
– Some indications that 4DVAR offers higher quality• Dynamically constraint increments not restricted to ensemble
space• 4Dvar with ensemble-based covariance superior
– Buehner et al – “hybrid” scheme
• How to estimate error variance in 4Dvar analysis?Two approaches• Run ensemble-based DA
– Very expensive– Not 4Dvar, but ensemble-based DA errors are
estimated• Estimates affected by DA methods/assumptions
• Alternative approach– Based on basic assumptions independent of DA
schemes– Under development / testing
ESTIMATING & REPRESENTING INITIAL CONDITION RELATED UNCERTAINTY
• Objective– Make initial perturbations consistent with
uncertainty in analysis
Two approaches available• Variational DA
– Estimate uncertainty in 3/4Dvar analysis– Initialize ensemble with estimated analysis error
variance• Ensemble-based DA
– Use one of several ensemble-based DA schemes
Accurate estimates of analysis error variances is critical for the proper initialization of ensembles. This variance is the initial uncertainty that the ensemble perturbations try to mimic. Because of large computational costs, not all DA schemes explicitly compute analysis error variances (e.g., GSI does not). Furthermore, estimates of analysis errors derived via DA schemes are influenced by the assumptions used to create the analysis fields, resulting in a scheme-dependent analysis error. For example, in regions where observations are scarce or the DA scheme gives low weights to them, the analysis errors will be highly correlated with the first guess errors, making it difficult to estimate the true errors (Simmons and Hollinsworth, 2002). A methodology for the estimation of analysis and forecast errors is introduced here. The method is based on a few simple assumptions that are independent of any data assimilation method and provides error estimates with a range of uncertainty.
A schematic showing an EDAS scheme where the ensemble (based on ET) produce a flow-dependent error covariance matrix that is combined with the static covariance matrix generated by a variational DA (GSI) and, in turn the analysis error-variance is used to initialize the Ensemble forecast scheme. The method introduced in this paper allows a scheme-independent estimation of error covariances to initialize the ensemble.
Estimating analysis error variances for ensemble initializationMalaquías Peña1, Isidora Jankov2 and Zoltan Toth3
1IMSG at EMC/NCEP/NOAA, 2CIRA at GSD, 3GSD/ESRL/NOAA
NOAA Earth SystemResearch Laboratory
))((2)()()( 2222 TATFTATFAFd
where d is the perceived root mean square error, F is the forecast, A is the
analysis, T is the true state and ρ is the correlation between true forecast error
and true analysis error.
Defining the true analysis error-variance: f02 ≡ (A-T)2
and the true forecast error-variance: flead2 ≡ (F-T)2,
the perceived forecast error variances measured on each lead time can be estimated via the following set of equations:
2. ConceptThe perceived error (forecast minus analysis at the verifying time) variance, is decomposed into the true analysis error variance and the true forecast error variance:
The following assumptions are used to simplify the problem:
1. Small initial errors grow exponentially and saturate following a logistic function. Therefore, the evolution of errors can be parameterized with a minimal number of parameters that can be obtained via observations of perceived errors. Departures from this evolution of errors will be attributed to model errors, which will be modeled with a continuous function.
2. At short lead times, errors are local. That is, we ignore advection of errors from neighboring gridpoints. At long lead times, errors at any gridpoint results from the influence of all surrounding gridpoints.
3. The correlation between analysis errors and forecast errors decreases on each analysis cycle at a power rate:
1. Introduction
The goal is to estimate the true analysis and forecast error-variance consistent with the perceived error variance observed on each lead time. This can be expressed as a minimization of the following J:
0612
02
626 2ˆ ffffd hhh
01222
02
122
12 2ˆ ffffd hhh
:
...,18,12,6),ˆmax( 122 hhhiwddJ iii }where the estimated perceived error variance,
2ˆid is given in (1).
3. Assumptions
ρm = (ρ1)m , m=2,..Mwhere ρ1 is the correlation at 6h lead time, ρ2 =(ρ1)2 is the correlation at 12h lead time, ρ3 = ρ1 ρ2 , is the correlation at 18h, etc. Only one parameter (ρ1) needs to be determined.
4. Application to Ensemble Data Assimilation Systems
5. Tests of the analysis error estimation method
A schematic showing an Ensemble Data Assimilation scheme where two assimilation schemes, one flow-dependent and the other static, are run and combined to produce a hybrid analysis. In this scheme, the analysis error estimation (derived from the EnKF scheme) is fed into the ensemble generation scheme but does not reflect the analysis error from the GSI.
Lagarias, J.L., J. A. Reeds, M.H. Wrights and P.E. Wright, 1998: Convergence properties of the Nelder-Mead Simplex Method in Low dimensions, SIAM J. Optim., 9, 112-147Lorenz, E. N., 1963: Deterministic nonperiodic flow. J. Atmos. Sci. 20: 130–141. Simmons, A. J. and A. Hollingsworth, 2002: Some aspects of the improvement in skill of numerical weather prediction. Quart. J. Roy. Meteor. Soc., 128:647–677.
Top panel. In blue: True Error variance; in red: Perceived Error variance; in green: the modeled Perceived Error variance. Bottom panel. In black: Anomaly correlation of analysis errors and forecast errors at different leads as generated from the model; in magenta: correlation using (2).GFS 500hPa total energy error at two gridpoints
(2)
SIMULATED FORECASTS WITH THE LORENZ 40 VARS
References
We apply the method to estimate analysis and forecast errors in the 3-variables Lorenz model (Lorenz 1963) under a perfect model scenario. Synthetic data is generated from the control run (nature) plus a random value. A 3DVar scheme is used to assimilate the data.
Left: Estimate of true forecast error variance (f, blue)
and the analysis error (f0) as estimated by the method,
and the fitting curve (dhat) on the perceived error (d). Right panels: The diagnosed correlation and the fitting error.Note that the optimization procedure (the simplex method; Lagrarias et al., 1998) produces a very good fit with the observed perceived errors.
Point in the Extratropics
Point in the Tropics Same as above but for a point in the tropics. Compare the true forecast error variances on left panels and note that the true errors are underestimated in the tropics. Right panels: Compare the correlation function and note that analysis and forecasts are much more correlated in the tropics than in the extratropics
(1)
wi is a weighting function to ensure that the fitting is best at the initial time, where we have most confidence of the measurements.
PROPOSED HYBRID EDAS
GSI-ENKF HYBRID EDAS
A schematic showing an EDAS scheme where the ensemble (based on ET) produce a flow-dependent error covariance matrix that is combined with the static covariance matrix generated by a variational DA (GSI) and, in turn the analysis error-variance is used to initialize the Ensemble forecast scheme. The method introduced in this paper allows a scheme-independent estimation of error covariances to initialize the ensemble.
A schematic showing an Ensemble Data Assimilation scheme where two assimilation schemes, one flow-dependent and the other static, are run and combined to produce a hybrid analysis. In this scheme, the analysis error estimation (derived from the EnKF scheme) is fed into the ensemble generation scheme but does not reflect the analysis error from the GSI.
• If cycled => Suboptimal covariance & forecast performance
• Alternatives tested: – Noise added to B to reduce rank deficiency is NOT cycled
• Two sets of first guesses – effective but expensive
– Ensemble Transform with Regularization of B (ETR) • Affects mainly the variance structure of Pa
– Minimal effect on covariance
• Superior performance with large ensemble– Effect is expected to be less with realistically small ensemble
• Particle Data Assimilation Coming …
WHAT WE LEARNT
ENSEMBLE INITIALIZATION – NEW APPROACHES
• Can rescaling method be revised?– To mimic background error reducing effect of DA?
• Work in progress• Still Gaussian / linear approach
• Quest for non-Gaussian ensemble formation– Need for highly nonlinear situations
• E.g, convective processes, Tropical Cyclone development– Positive impact expected both for
• Ensemble performance• Data assimilation (cleaner covariances)
MODEL RELATED UNCERTAINTIES
• Origin– Due to truncation / approximations
• Finite spatial resolution• Finite time steps• Approximations in physics
• Forecast impact– Random errors added at each time step
• Can be considered only stochastically
• Design of current generation models– Aimed at making best single (unperturbed) forecast
• Minimize RMS error– Intentionally ignores effect of unresolved processes
• Fine spatial scales• Fine time scales• Full physics
– No stochastic representation of unresolved processes
NEW NWP MODELING PARADIGM
• Ensemble application– Effect of unresolved processes must be represented
• Otherwise ensemble cloud misses reality
• Approach– Stochastically simulate (generate) variance equal to
error associated with each process truncated /approximated in model
• Major change in modeling approach– Focus on / test in ensemble
• Instead of single unperturbed forecast– Build stochastic element to represent model related
errors into each model component• Major effort needed
CURRENT METHODS TO REPRESENTMODEL RELATED ERRORS
• Multi-model/physics (Houtekamer et al)– Ad hoc and pragmatic approach– Unique & distinct solutions
• Cannot provide continuum of realizations– No scientific foundation
• Giving up ideal of capturing nature in cognizant manner
• Stochastic perturbations added (Buizza et al)– Formal (not fully informed) response to need– Random noise has very limited effect
• Structured noise used at NCEP (Hou et al)
• Stochastic physics (Teixeira et al)– Right approach?– Capture/simulate, not suppress effect of unresolved
processes
REPRESENTING MODEL RELATED UNCERTAINTY:
A STOCHASTIC PERTURBATION (SP) SCHEMEGeneral Approach: Add a stochastic forcing term into the tendencies of the model eqsStrategy: Generate the S terms from (random)
linear combinations of the conventional
perturbation tendencies.
Desired Properties of Forcing1. Applied to all variables2. Approximately balanced 3. Smoothly varying in space and time4. Flow dependent5. Quasi-orthogonal
FINE SCALE ENSEMBLE EXPERIMENTSTO CAPTURE MODEL RELATED UNCERTAINTY
• Recognize importance of microphysics for moist processes
• Capture model related forecast uncertainty
• Hydrometeorological Testbed (HMT) ensemble
• Two model cores, various microphysics schemes
Ensemble Prediction System Development for Aviation and other Applications
Isidora Jankov1, Steve Albers1, Huiling Yuan3, Linda Wharton2, Zoltan Toth2, Tim
Schneider4, Allen White4 and Marty Ralph4
1Cooperative Institute for Research in the Atmosphere (CIRA),Colorado State University, Fort Collins, CO
Affiliated with NOAA/ESRL/ Global Systems Division
2NOAA/ESRL/Global Systems Division
3Cooperative Institute for Research in Environmental Sciences (CIRES)University of Colorado, Boulder, CO
Affiliated with NOAA/ESRL/Global Systems Division
4 NOAA/ESRL/Physical Sciences Division
BACKGROUND
• Objective• Develop fine scale ensemble forecast system
• Application areas• Aviation (SF airport)• Winter precipitation (CA & OR coasts)• Summer fire weather (CA)
• Potential user groups• Aviation industry, transportation, emergency and
ecosystem management, etc
EXPERIMENTAL DESIGN 2009-2010
Nested domain: • Outer/inner nest grid spacing 9 and 3 km, respectively.• 6-h cycles, 120hr forecasts foe the outer nest and 12hr forecasts for the inner nest • 9 members (listed in the following slide)• Mixed models, physics & perturbed boundary conditions from NCEP Global Ensemble
• 2010-2011 season everything stays the same except initial condition perturbations?
48-hr forecast starting at 12 UTC, 18 January 2010
Reliability of 24-h PQPF
34 34http://esrl.noaa.gov/gsd/fab
OAR/ESRL/GSD/Forecast Applications Branch
Reliability diagrams of 24-h PQPF 9-km resolution Dec 2009 - Apr 2010
Observed frequency vs forecast probabilityOverforecast of PQPFSimilar performance for different lead times
Brier skill score (BSS):Reference brier score is Stage IV sample climatologyBSS is only skilful for 24-h lead time at all thresholds and for 0.01 inch/24-h beyond 24-h lead time.
CYCLING FINE SCALE PERTURBATIONS
• How to create dynamically conditioned fine scale perturbations consistent with forcing from global ensemble?
• Current approach• Interpolate global perturbed initial conditions• Fine scale motions missing initially
• Need to spin up• UKMet, Canadian, part of NCEP SREF etc ensembles
• Cycle LAM perturbations
Initial Perturbations for HMT-10/11“Cycling” GEFS (or SREF) perturbations
36
00Z 06Z
Global Model Analysis interpolated on LAM grid
LAM forecast driven by global analysis
Forecast Time12Z
Perturbations
37
Cloud CoverageJuly 30 2010 00UTC
00hr
03hr
06hr
LAPS CYC NOCYC
ON THE HORIZON:COUPLED DA – ENSEMBLE SYSTEMS
• Analysis / forecast error estimation independent of DA schemes• Poster results
• Rescaling of global perturbations consistent with DA
• Nonlinear / non-Gaussian initial perturbations via Bayesian particle filters• Coupled with 4-DVAR
• Bayesian particle filter for nonlinear / non-Gaussian DA / ensemble forecast system
CHALLENGES IN DA - RELATE OBSERVATIONS TO MODEL VARIABLES
• Critical step in relating reality to NWP model• Wide range of observing systems /
• Background must contain all information available• Prior to latest observations
• Ideally a short range NWP forecast from latest analysis• As used in global DA
• Only limited success with Limited Area Model (LAM) applications• Full LAM DA/forecast cycling attempts fail• Noise around boundary conditions amplify via cycling?
• Current approach• Periodically cold-start LAM analysis cycle from global analysis
• E.g., NAM, RUC at NCEP• Scientifically unsatisfactory, suboptimal performance
• Promising experiments at GSD• Use lateral boundary as constraint in LAM analysis• Bring in dynamically consistent fine scale info from LAM
background
Cycling Impact on STMAS analysis
Without Withcycling
STMAS-WRF ARW cycling Impact
OAR/ESRL/GSD/Forecast Applications Branch
CHALLENGES IN DAENSURE CONSISTENCY WITH MODEL DYNAMICS 1
• Dynamically inconsistent information is lost - insult• Quick transitional process introduces additional
errors – injury
• Possible approaches• Balance constraints – widely used• Digital filter – E.g., Rapid Refresh cycle at GSD• 4-DVAR – used in global DA• Additional constraints
• Local Analysis and Prediction System (LAPS) “Hot Start”• “Conceptual” relationships among moist & other variables
Local Analysis and Prediction System (LAPS)
Steve Albers, Dan Birkenheuer, Isidora Jankov
Paul Schultz, Zoltan Toth, Yuanfu Xie, Linda Wharton
CHALLENGES IN DAENSURE CONSISTENCY WITH MODEL DYNAMICS 2
• Choice of control variable affects increments
• Integrated variables like streamfunction & velocity potential poor choice• To preserve integrated quantities, analysis must
introduce spurious fine scale fluctuations to compensate for observation related changes
• Use vorticity and divergence (or u,v)
Control Variable Issues
Xie et al 2002 has studied the analysis impact by selecting different control variables for wind, ψ-χ, u-v, or ζ-δ VARS.
Conclusions:• ζ-δ is preferable (long waves on OBS; short on
background);• u-v is neutral; • ψ-χ is not preferred: short on OBS; long on
background;
OAR/ESRL/GSD/Forecast Applications Branch
Non-physical error by ψ-χ VARS
• Ψ and χ are some
type of integral of
u-v. A correction to
the background toward the obs changes the integral of u. The background term adds opposite increment to the analysis for keeping the same integral as the background.
OAR/ESRL/GSD/Forecast Applications Branch
Responses of three type of VARS
For a single obs of u, e.g., ψ-χ, u-v, or ζ-δ VARS have different responses as shown here. A ψ-χ VAR may not show this clearly as many filters applied.
OAR/ESRL/GSD/Forecast Applications Branch
CHALLENGES IN DAENSURE CONSISTENCY WITH MODEL DYNAMICS 2
• Background error covariance info used to spread observational info across space/time/variables
• Problems with estimation & use of covariance abound• Poor estimates from small sample of lagged or
ensemble forecasts• Long distance correlation estimates on small scales can
be spurious
• Possible solution• Multiscale analysis by Yuanfu Xie et al• Space-Time Multiscale Analysis System (STMAS)
Space and Time Multiscale Analysis System (STMAS)
Yuanfu Xie, Steve Albers, Brad Beechler, Huiling Yuan
Contributors:
GSD: Dan Birkenheuer, Zoltan Toth, Steve Koch
UC Boulder: Tomoko Koyama
National Marine Data and Information Service (NMDIS): Wei Li, Zhongjie He, Dong Li
Central Weather Bureau (CWB): Wenho Wang, Jenny Hui
NCAR: Paul Beringer
Collaborators:
MIT/Lincoln Lab, CWB, NMDIS, NOAA/MDL and NOAA/ARL
• Tornado touched down at Windsor, Colorado around 17:40 UTC, 22 May 2008
• STMAS initialization 1.67 km 301 x 313 background model: RUC 13km, 17 UTC hot start (cloud analysis)• Boundary conditions: RUC 13km, 3-h RUC forecast (initialized at 15 UTC)• WRF-ARW 1.67 km, 1-h forecast Thompson microphysics
• Postprocessing: Reflectivity
00-01hr reflectivity cross-section initialized at 17 UTC 22 May 2005, mosiac radar vs. WRF forecast (STMAS)
00-01hr reflectivity cross-section initialized at 17 UTC 22 May 2005, WRF forecast, RUC vs. STMAS
CHALLENGES IN DAENSURE CONSISTENCY WITH MODEL DYNAMICS 3
• What is best approach to initialize imperfect models?• Systematic difference between reality and its model
• Traditional approach• Remove systematic model error from background
forecast
• Introduce new paradigm• “Map” observations onto model attractor• Assimilate data in model attractor space• Run numerical forecast• “Remap” forecast state back to space of reality
65
FORECAST INITIALIZATION FOR IMPERFECT NUMERICAL MODELS
Environmental Modeling Center
NOAA/NWS/NCEP USA
1SAIC at Environmental Modeling Center, NCEP/NWS
Acknowledgements: Dusanka Zupanski, Guocheng Yuan
http://wwwt.emc.ncep.noaa.gov/gmb/ens/index.html
Zoltan Toth
and
Malaquias Pena Mendez1
66
• CHAOTIC ERRORS– Statistical approach
• “NMC” method (differences between past short-range forecasts verifying at same time)
• Ensemble method (differences between past ensemble forecasts verifying at same time)
– Inflation of background errors (ie, move analysis closer to observations)» Multiply background error covariance matrix in 3/4DVAR» Increase initial perturbation size in ensemble-based DA
• Assume model-related errors are stochastic with characteristics different from chaotic errors (D. Zupanski et al)
– Introduce additional (model) error covariance term (allow analysis to move closer to obs.)
– How statistics determined?
• Assume errors are systematic (Dee et al)
– Estimate systematic difference between analysis and background
– Before their use, move background by systematic difference closer to analysis
HOW TO TREAT FORECAST ERRORS IN DA
Move background toward nature
Move background toward nature
IS THIS THE RIGHT MOVE?Treat initial and model error the same way?
67
HOW TO USE MAPPING IN DA / NWP FORECASTING?
1. Estimate the mapping between nature and the model attractors
2. Map the observed state of nature into the space of the model attractor– Move obs. with mapping
vector
3. Analyze data4. Run the model from the
mapped initial condition5. “Remap” the analysis and
forecast back to the phase space of nature
Standard procedure
New
ste
pN
ew s
tep
Challenging step
68
MAPPING VECTOR
• Definition
– Vector that provides best remapped forecast performance
• Estimation
1. Difference between long term time means of forecast trajectory & nature – Evaluate at observation sites (H)
In practice, nature is not known• Use traditional analyses as proxy:
2. Adaptive technique• If systematic errors are regime dependent, or climate means are
not available• Details later
69
MODEL & DA DETAILS• Lorenz (1963) 3-variable model:
• Satellite and GPS data assimilation• Chinese and Japanese Geostationary satellite
• Low level wind and solar radiation
OUTLINE / SUMMARY• Why ensembles?
– Knowledge about uncertainty• Complete the forecast with probabilistic information• User needs for covariance / scenarios
• Capturing uncertainty in initial conditions – Focus on dynamical consistency in perturbations– Derive estimate of error variance in best analysis
• New NWP modeling paradigm– Focus on ensemble (not single value) forecasting– Stochastically represent effect of unresolved scales
• Choice of DA scheme– Variational or sequential?– Both can use ensemble-based covariances
• Challenges in data assimilation– Careful attention to many details related to
• Forward operators, control variables, DA/forecast cycle, moist constraints, dynamical consistency (initialization of forecast with imperfect models), etc
– Ensemble-based covariances – “hybrid” mehod?
BACKGROUND
Noise in EnKF
Noise added
Forecast (Xb noisy)
Xa
Xa noisy Bnoisy
DA(xa)
Design 1. Cycles Xb noisy . Typical application
Design 2. Cycles Xb clean . New design
Noise added
Forecast (Xb noisy)
Xa noisy
Xa noisy
noise
Forecast (Xb noisy)
Xa
Xa noisy Bnoisy
Forecast (Xb clean)Bclean
Pa clean
DA(xa)
noise
Forecast (Xb noisy)
Xa clean
Xa noisy
Forecast (Xb clean)
Pa noisy
Too expensive to run in an operational environment
Noise in EnKF (Cont.)
Forecast (Xb)Xa B+Binvertible
DA(xa)
Design 3. Correlated noise in B
Forecast (Xb)Xa
Examples for Design 3:
A) Hybrid ensemble Kalman Filter (Hamill and Snyder, 2000)
• Bhybrid =(1-α)BEnKF+αB3DVar
• α is a weighting parameter between 0 (only flow-dependent) and 1 (only 3DVar)
• Variance and covariance of B equally affected
B) New approach: Regularization (Tikonov, 1943) procedure for B
• Brid = BEnKF+ λDiag(BEnKF)
• λ is a ridging parameter
• Variance is affected primarily (that is later scaled back)
• Covariance only minimally affected
Pa
ET with Regularization (ETR)• ET is an ensemble perturbation method (produces Xa (=XfT) )• Since B can be formed from Xf, and R-1 is known, the analysis
precision relationship:
can be added to ET for DA applications• When B-1 exists, ET plus the analysis precision relationship is
equivalent to ETKF (Bishop et al 2001)• By construction, however, ET produces a B that is rank
deficient, thus B-1 does not exist (ETKF framework does not constrain B to be invertible)
• A regularization procedure is implemented to reduce the rank deficiency problem in ET, thus ETR
111 RBA
Experimental Setting
• 3-DVar Minimizing the following cost function
)]()()()[()( HxyRHxyxxBxxxJ oTobb 11
2
1
• Lorenz 96 Model (with ’07 pars)
where F= 5.1 and m=1,..,21
• Perfect model scenario. One observation per grid-point. 6-hr assimilation cycle• Uncorrelated observational error: normally distributed random noise with unit variance (R=I)
• Ensemble Transform (perturbation method)
• B is computed using NMC method
TXX fa xxxxxxX
K211K
1,...,,where
fTf A XX 1)(and T is a function of eigenvectors and eigenvalues of
RMS analysis error
ETKF with cycling random noise
ETR with minimal regularization (.001)
ETR with scaled analysis amplitude
Evolution of analysis error in three gridpoints (different color lines) for ETKF and ETR; same start.
ETKF no cycling random noise
Time (days)
In this situation, ETR ≈ ET
Mae=0.42Nearly
identical
Mae=0.21
Mae=0.42
Mae=0.36
Design 1
Design 2 Design 3
Cycling noise
Snapshot of B and B-1
Well-sampled, stable, fixed (no “errors of the day”)
ETKF, no inflation
Noisy covariance due to sampling errors. Spurious long-distance correlations. Matrix ill-conditioned.Filter diverges
3DVAR – NMC method
After 4 cycles
3DVar - NMC method
ETKF with random perturbations added to each member before performing time integration. Inverse becomes stable Prevents filter divergence; However, noise cycled
ETKF, no inflation
Design 1
Two ensembles run with ETKF, one with (used for estimating B, then discarded), another without addition of noise (used for cycling covariance); Noise still impacts initial perturbations