Top Banner
Progress Closure Quantification of Uncertainty in Extreme Scale Computations www.quest-scidac.org Habib N. Najm [email protected] Sandia National Laboratories Livermore, CA And the QUEST team at large 2015 SciDAC-3 PI Meeting July 22 – 24, 2015 Washington, DC SNL Najm QUEST 1 / 32
32

Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Mar 31, 2018

Download

Documents

dokhanh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure

Quantification of Uncertaintyin Extreme Scale Computations

www.quest-scidac.org

Habib N. Najm

[email protected] National Laboratories

Livermore, CA

And the QUEST team at large

2015 SciDAC-3 PI MeetingJuly 22 – 24, 2015Washington, DC

SNL Najm QUEST 1 / 32

Page 2: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure

Acknowledgement

QUEST Team:

SNL M. Eldred, B. Debusschere, J. Jakeman,K. Chowdhary, C. Safta, K. Sargsyan, P. Rai

USC R. Ghanem

Duke O. Knio, O. Le Maıtre, J. Winokur, G. Li

UT O. Ghattas, R. Moser, C. Simmons, A. Alexanderian

LANL J. Gattiker, D. Higdon, E. Lawrence, S. Bhat

MIT Y. Marzouk, D. Bigoni, T. Cui, M. Parno

This work was supported by:

US Department of Energy (DOE), Office of Advanced Scientific ComputingResearch (ASCR), Scientific Discovery through Advanced Computing (SciDAC)

Sandia National Laboratories is a multiprogram laboratory operated by Sandia Corporation, a Lockheed MartinCompany, for the United States Department of Energy under contract DE-AC04-94-AL85000.

SNL Najm QUEST 2 / 32

Page 3: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure

Outline

1 Progress Highlights

High DimensionalityLocal KLEBasis AdaptationLow Rank Sparse Tensor Representations

Model ComplexityMultifidelity methodsHierarchical CalibrationAdaptive Sparse Quadrature

Statistical InversionOptimal Experimental DesignAdaptive Local Surrogates

Architecture AwarenessCurrent PracticeLooking Ahead

2 Closure

SNL Najm QUEST 3 / 32

Page 4: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

High Dimensionality and UQ

Dimensionality of UQ problem is the number of degrees offreedom required to represent uncertain model inputsand/or parameters

Number of parametersKarhunen-Loeve expansion (KLE) for random fields

Hi-D challenge in UQ: high-dimensional integration

We discuss advances inLocal KLE

Reduced KLE dimensionality for random field in subdomainsBasis adaptation

Isometric transformations to low-dimensional subspacesLow rank sparse tensors

Combinations of low-D integrals

SNL Najm QUEST 4 / 32

Page 5: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Dimensionality Reduction via Local KLESNL, Purdue, ETH, U.Utah Karhunen Loeve Expansion

We wish to solve PDEs such as−∇ · (a(x, ω)∇u(x, ω)) = f(x), x ∈ D,u = g(x) x ∈ ∂D.

Parameterize the random field a(x, ω) using KLE

a(x, ω) ≈ a(x, Z) = µa(x) +

d∑i=1

√λiψi(x)Zi(ω).

Divide D into a set of non-overlapping subdomains D(i), i = 1, . . . ,K

The original problem on the full domain can be solved in eachsubdomain with proper coupling conditions at the interfaces.

The collection of the subdomain solutions is equivalent to that of theoriginal problem in the global domain, i.e.,

u(x, ω) =K∑i=1

u(i)(x, ω)ID(i)(x),

SNL Najm QUEST 5 / 32

Page 6: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Local KLE – EigenstructureWe represent the restriction of theinput process a(x, ω) in thesubdomain D(i) as

a(i)

(x, ω) ≈ µ(i)a (x) +

d(i)∑j=1

√λ(i)j ψ

(i)j (x)Z

(i)j (ω)

The decay rate of the eigenvaluesdepends critically on the relativecorrelation length.

The rel. correl. length on each subdomainis larger than that on the full domain.

Local KL eigenvalues decayfaster

a(i)(x, ω) parameterized w/ asmaller number of randomvariables, thus d(i) d

0 10 20 30 40 50 60 70 80 9010

−16

10−14

10−12

10−10

10−8

10−6

10−4

10−2

100

102

104

Global Eigenvalues

Local Eigenvalues on 8 × 8 subdomains

Local Eigenvalues on 16 × 16 subdomains

Local Eigenvalues on 32 × 32 subdomains

Decay of eigenvalue magnitudes

SNL Najm QUEST 6 / 32

Page 7: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Local KLE – Algorithm

Off-line phase

In parallel Build (independently)PCE surrogate on eachsubdomain D(i)

For linear PDE each randomrealization requires ndof:∂D(i) + 1

On-line phase

Generate a realization of theradom field on the global domain

Project global field onto eachsubdomain to obtain parametersZ(i) of local KLE a(i)(x, Z(i))

Evaluate local PCE at localrandom parameters Z(i)

Generate solution on eachsubdomain by solving globalinterface problem Error vs. # sub-domain solves

SNL Najm QUEST 7 / 32

Page 8: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Basis Adaptation to Quantities of Interest – USC

QoIs in hi-D systems are frequently low-dimensional

We developed a procedure for basis discovery/adaptationpermits efficient and accurate approximation within alow-dimensional subspace in which the QoI is concentrated

Using Gaussian parameterization of the uncertain inputs,Isometry is first applied to induce a desired structure in therepresentation of the QoI

1st-order terms in one dimensiondiag. quadratic form w/ 2nd-order terms; match target CDF

Reduction is then achieved through projection of theresulting representation

Reduced model captures:the probabilistic content of the QoIits functional dependence on the original parameterization

SNL Najm QUEST 8 / 32

Page 9: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Basis Adaptation Demo – Convective Heat Transfer

Consider a 2D domain with flow past random heatedinclusions, described in high stochastic dimension

Flow past thermal inclusions. The rods have spatially varying thermal conductivities.

An upscaled effective stochastic porous medium iscomputed. The QoI at every spatial point is thehomogenized permeability and conductivity.

permeability and conductivity are statistically dependent.can be evaluated as functions of the fine scale randomness.

SNL Najm QUEST 9 / 32

Page 10: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Basis Adaptation to Quantities of Interest – Demo

Upscaled stochastic permeabilityverified at one spatial point.

Upscaled stochastic conductivityverified at the same spatial point.

Basis adaptation makes it feasible to evaluate upscaledproperties at each spatial location as function of fine scaleuncertainty.Solve a number of low-stochastic-dimension UQ problemsinstead of one high-D problem

SNL Najm QUEST 10 / 32

Page 11: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Fast Evaluation of MP Integrals (FEMPI) — Quantum ChemistrySNL, UIUC ASCR-BES Partnership

Computational Challenge:

Accurate computational prediction of key molecularproperties requires ab initio all-electron theories.Initial focus on vibrational and electronic structure integralsIntegrands involve series of tensor contractions and densematrix manipulations — Nonscalable!Better scaling achieved via enhancements of Monte-Carlo.

QUEST:Improve integration efficiency and scalability

Advanced hi-D function representations used in UQLow rank sparse tensor representations

Replace hi-D integral with a number of low-D integralsEvaluate using sparse quadrature

SNL Najm QUEST 11 / 32

Page 12: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Vibrational Energy integral – Water Molecule – XVH2Main approach: low-rank approximation of integrand:

f(x) ≈R∑i=1

m∏k=1

fki (xk), xk ∈ x = (x1, . . . , xd), ∪mk=1xk = x

High-dimensional integral is estimated via several low-d integrals∫Ωx

f(x)dxlow-rank≈

R∑i=1

m∏k=1

∫Ωxk

fki (xk)dxkquad.≈

R∑i=1

m∏k=1

Q∑q=1

wqfki (xqk)

E.g., second-order correction to zero-point energy (6D):

E(2)0 =

∫e−||ω

Tx||2∆V (x)H(x,x′)e−||ωTx′||2∆V (x′)dxdx′

103 104 105

Sample Size, N

10-3

10-2

10-1

100

Relative error in E

(2)

0

Low-Rank HG Quad.

Monte-Carlo

Random-Walk MC

Ongoing work:

Singular integrals from MP2 theory.Exponential sum apprx + low-rank.

Automatic detection of groupings xkand sparsity within them.

Scale up to larger systems.

SNL Najm QUEST 12 / 32

Page 13: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Model Complexity

Large-scale models require substantial computationalresources for solving the original deterministic problem

This can lead to infeasible costs for either intrusive ornon-intrusive/sampling-based UQ methods

We discuss advances inMultifidelity UQ methods

Use of predictions from models at different levels of fidelityHierarchical calibration and model discrepancy

Use of model bias and discrepancy in statistical calibrationAdaptive sparse quadrature

Selection of computational samples for hi-D integration

SNL Najm QUEST 13 / 32

Page 14: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Multiple Model Forms in UQ – SNL

Given a clear hierarchy of fidelity:Multifidelity forward UQ:

UQ for hi-fi model leveragingcheaper low-fi models

Multifidelity inference:Estimation of low-fi modeldiscrepancies

Given a non-hierarchical ensembleof credible models:

Model probability – prior infoBayesian model selectionModel averaging

Both hierarchy and peersLeverage model selection andmultifidelity inference

SNL Najm QUEST 14 / 32

Page 15: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Multifidelity UQ Using Stochastic Expansions

High-fidelity simulationscan be prohibitive for usein UQ

Low fidelity “design” codesoften exist that arepredictive of basic trends

Leverage LF codes w/ HF UQGlobal approximations of model discrepancyAdaptive sparse grids:

Gen. sparse grids for LF & discrepancy levelsGreedy selection from grids: max ∆QoI/∆CostRefine discrepancy where LF is less predictive

Compressed sensing:Target sparsity within the model discrepancy

SNL Najm QUEST 15 / 32

Page 16: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Hierarchical Calibration & Model Discrepancy – LANL

Hierarchical calibration addresses the relationship betweenmodel discrepancy and parameter bias

Given different calibration examples with different bias

yi = η(x, θ + bi) + δi(x, θ + bi) + εi

e.g. climate model bias different at high vs. low latitudes

Employ a hierarchical model, reconciling the evidence of biasInferred discrepancy effects are better fit to problemsDiagn. of relationship bet. parameter bias & discrepancyAdditional source of uncertainty identifiable in UQ analysesCapability has been in GPMSA, developing the clarifyingexamples and framework of diagnostics for user adoption

SNL Najm QUEST 16 / 32

Page 17: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Hierarchical Calibration Demo – Southern Ocean

Idealized southern ocean modelwith two parameters

Calibration w.r.t. higher fidelityParallel Ocean Program (POP)computations

LANL+NCAR

A number of metricsTemperature, salinity, densityvs. depthVertical heat & salt transport

Hierarchical distributioncombines information fromdifferent metrics

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5x 107

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

SNL Najm QUEST 17 / 32

Page 18: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Adaptive Sparse Quadrature (ASQ) – Duke/MIT

Non-Intrusive Pseudospectral projectionSparse tensorization of 1-D quadrature formulaeReduce number of simulations, improve accuracy

Adaptivity:Progressive construction with cost controlRobust error indicator to guide adaptationNested hierarchical approximationSensitivity-based directional refinement

Application to forward UQ in Gulf of Mexico modelingChallenges with failed computational samples

both ASQ & MC-LHSUse L2-misfit constrained L1-norm minimization (BPDN)

Estimate QoIs: sparse learning from available samples

SNL Najm QUEST 18 / 32

Page 19: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

UQ for Circulation in Gulf of Mexico

Impact of uncertainty in:Initial conditions (4 dims)Wind stress (4-dims)

Time-dependent EOFs

on circulation in Gulf of Mexico

Eigenvalue decay – SSH

mode index

1 2 3 4 5 6 7 8 9 10

log(λi)

100

101

102

103

Mean (left) and STD (right) of sea surface height (SSH) at day 30

SNL Najm QUEST 19 / 32

Page 20: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Statistical Inverse Problems

Statistical inversion is used for data-based estimation ofmodel parameters/inputs with quantified uncertainty

Inverse problems are hard!Typically ill-posed; ill-conditionedHigh-dimensionalityForward model complexity

We discuss select recent advances in Bayesian inversionOptimal experimental design (OED)

Identify optimal sensor placement for geophysical inversionSurrogates & Markov chain Monte Carlo methods

Adaptive local surrogatesParallel MCMC methods

SNL Najm QUEST 20 / 32

Page 21: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Scalable algorithms for optimal exptl design (OED)Large-scale Bayesian inverse problems UT

Context: Inference of parameter fields w/ quantified uncertainty

OED asks the “outer loop” question:How to choose sensor locations so that the inferredparameter field uncertainty is minimized?

In its full generality, this is intractable:Inner problem alone is an infinite dimensional Bayesianinverse problem

Approach:Represent covariance by inverse Hessian of negative logposterior (Laplace approximation)Invoke fast randomized trace estimatorsEmploy techniques from PDE-constrained optimization

Result: OED method whose cost–measured in forward PDEsolves–scales independent of parameter/sensor dimension

SNL Najm QUEST 21 / 32

Page 22: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Formulation of OED for Bayesian inversionHessian/PDE-constrained optimization problem

Seek an experimental design w (e.g., sensor locations) tocollect data d to minimize average posterior variance

OED problem:Minimize average variance given by trace of inverseHessian, evaluated at maximum a posteriori solution ofinverse problem m∗:

minw

Ed

trace

[H−1

(m∗(w),w;d)

] Sample averaging to approximate expectation over d

Randomized trace estimation of H−1

SNL Najm QUEST 22 / 32

Page 23: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

A-optimal sensor placement for inferringlog-permeability in subsurface flow (SPE model)

Posterior variance with various sensor placements

Optimal Sub-optimal Sub-optimal

Inference with the optimal design (parameter dim ∼ 104)

True parameter Posterior mean Posterior sample

SNL Najm QUEST 23 / 32

Page 24: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Asymptotically Exact MCMC MITEarlier times Later times

Inference in computationally intensive models is essentiallyintractable without surrogates

Key questions: Where should a surrogate be accurate? How toconstruct it? Should it depend on the data? How does error inthe surrogate corrupt inference?

Our approach: incremental and asymptotically exactconstruction of posterior-focused model approximations

SNL Najm QUEST 24 / 32

Page 25: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Asymptotically Exact MCMC – Surrogates MIT

Framework includes several different local approximationschemes: linear, quadratic, Gaussian process

Accuracy versus cost (below); orders of magnitude speedups

102 103 104 105

Total number of evaluations

10 2

10 1

100

Rela

tive c

ovari

ance

err

or

True model

Linear

Quadratic

GP

Recent developments:

Surrogates coupled with more sophisticated (gradient andHessian-exploiting) MCMC proposals

Parallel MCMC chains, sharing a common pool of modelevaluations

SNL Najm QUEST 25 / 32

Page 26: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Parallel MCMC with Surrogates MIT

Build a common pool of model runs across parallel workers

Approximation guaranteed to target the correct distribution; useeffective sample size (ESS) to measure efficiency

ESS per CPU-sec usually constant with simple parallel MCMC

Instead, it increases dramatically: chains “borrow strength”

Cost of a computationally intensive contaminant transportinverse problem reduced from 200 hours to 30 minutes

10 1 100 101 102 103

Run time (hours)

10 4

10 3

10 2

10 1

100

(ESS p

er

CPU

)/se

cond True model

AM Quadratic (1)

AM Quadratic (2)

AM Quadratic (5)

AM Quadratic (10)

AM Quadratic (20)

MMALA Quadratic (1)

MMALA Quadratic (10)

SNL Najm QUEST 26 / 32

Page 27: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Architecture Awareness

UQ on massively-parallel heterogeneous architecturesScalability – Load balancing, communication, synchronyMPI, OpenMP, GPU/. . .Memory utilizationParallel I/O – dataFault tolerance and resilience

Consequences:UQ problem formulation, algorithms, software

We discuss our practice & vision in architecture-aware UQUQ librariesNon-intrusive/sampling-based UQIntrusive stochastic-Galerkin UQ

SNL Najm QUEST 27 / 32

Page 28: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Architecture Awareness – Current Practice – SNL

We lower the bar for UQ on advanced architecturesLarge compute ensembles on leadership-class machines

Multilevel optimized partitioning & schedulingRelax reqmt to converge all simulations for all partitions

Fault tolerance, failure mitigation & restart

We ease UQ adoption via usability features/enhancementsLibrary embedding of UQ services in applications

Embed in familiar apps; eliminate custom interface codeSimplify parallel execution; e.g. FELIX/Dakota (PISCEES)

Rapid prototyping and integration with scripting languages

We invest in emerging capabilities, directly or leveragedAdvanced fault tolerance (ASCR UQ)Advanced UQ workflows (QUEST/SUPER, MUQ DAGs)Centralized accessibility (e.g., Github) to maximizecommunity adoption and involvement

SNL Najm QUEST 28 / 32

Page 29: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Forward Vision – Non-intrusive UQSNL

Leverage emerging runtime systems for task-basedparallelism management within QUEST tools

e.g. Legion, Charm++, HPX, UintahMigrate from imperative hybrid-MPI scheduling todeclarative parallelism models

Aggregate UQ and simulation workflow tasks within thesame runtime system, exposing new opportunities forstreamlining, asynchrony, etc.

Move toward loop reordering / embedded ensemblesMappers control task delegation to hybrid hardware

– CPU, GPU, MIC

SNL Najm QUEST 29 / 32

Page 30: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Forward Vision – Intrusive UQSNL

Identify application partners for intrusive UQ methodsAdditional dedicated investments for selected applicationsStochastic Galerkin methodsHybrid Galerkin-Collocation methods

Different levels and types of intrusion, in terms ofSoftware (library linking)Coupling strategy (multiphysics/multiscale UQ)Parallel task scheduling (aggregation of runtime workflows)The actual simulation/solver

Available intrusive and linear algebra libraries in TrilinosStokhos: stochastic Galerkin systemsTpetra: serial and distributed parallel linear algebraKokkos: manycore performance portability

SNL Najm QUEST 30 / 32

Page 31: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure HiD Complexity Inverse Arch

Sparse Linear Stochastic Galerkin Solvers – SNL

Explore algorithmic constructions that show potential to keepfuture hierarchy of HPC cores busy

Additive-multigrid/multilevel (physical/stochastic)Phys. mesh coarse/fine on communication/compute coresDecoupled stochastic prolongation/restriction operatorsHigher-order stoch. levels =⇒ compute intensive cores

Recursive hierarchical matrix preconditioned inversionBreak up matrix hierarchically into smaller nested blocksEach of which can be solved more easily and independently

Hybrid stochastic Galerkin/collocation approachesCoupled intrusive/non-intrusive strategyTarget optimal use of computational architectureTradeoff solution samples of deterministic problem forreduced-size/better-conditioned stochastic Galerkin system

SNL Najm QUEST 31 / 32

Page 32: Quantification of Uncertainty in Extreme Scale ... · PDF fileuanti cation of Uncertainty in Extreme Scale Computations ... C. Safta, K. Sargsyan, P. Rai ... Fast Evaluation of MP

Progress Closure

Closure

Presented select highlights of recent progressHigh dimensionalityModel complexityStatistical inversionArchitecture

We continue toRefine and robustify QUEST algorithms and software toaddress UQ challenges in large-scale problemsAddress UQ needs of SciDAC application partnerships

quest-scidac.org

SNL Najm QUEST 32 / 32