Fill Reduction Algorithm Using Diagonal Fill Reduction Algorithm Using Diagonal Markowitz Scheme with Local Symmetrization Markowitz Scheme with Local Symmetrization Patrick Amestoy ENSEEIHT-IRIT, France Xiaoye S. Li Esmond Ng Lawrence Berkeley National Laboratory
21
Embed
Fill Reduction Algorithm Using Diagonal Markowitz Scheme with Local Symmetrization Patrick Amestoy ENSEEIHT-IRIT, France Xiaoye S. Li Esmond Ng Lawrence.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Fill Reduction Algorithm Using Diagonal Fill Reduction Algorithm Using Diagonal Markowitz Scheme with Local Markowitz Scheme with Local
Symmetrization Symmetrization
Patrick Amestoy
ENSEEIHT-IRIT, France
Xiaoye S. Li
Esmond Ng
Lawrence Berkeley National Laboratory
SIAM CSE03, Feb 10-13, 2003 2
ContentsContents
Motivation
Graph models for Gaussian elimination
Minimum priority metrics
Experimental results
Summary
Add: Runtime and space complexity
SIAM CSE03, Feb 10-13, 2003 3
Motivation -- New Sparse LU Factorization AlgorithmsMotivation -- New Sparse LU Factorization Algorithms
Inexpensive pre/post-processing Equilibration (or scaling) Pre-permute rows or columns of A to maximize its diagonal
Find a matching with maximum weight for bipartite graph of A Example: MC64 [Duff/Koster ‘99]
Iterative refinement
GESP (static pivoting) [Li/Demmel ‘98, SuperLU_DIST] Pivots are chosen from the diagonal Allow half-precision perturbation of small diagonals
Unsymmetrized multifrontal [Amestoy/Puglisi ‘00, MA41_NEW] Prefer diagonal pivoting, but threshold pivoting is possible Allow unsymmetric fronts, but dependency graph is still a tree
Diagonal is (almost) goodStruct(L’) Struct(U)
SIAM CSE03, Feb 10-13, 2003 4
Existing Ordering Strategies to Preserve SparsityExisting Ordering Strategies to Preserve Sparsity
Symmetric ordering algorithms on A’+A Greedy algorithms
•Bipartite graph•After a vertex is eliminated, all the row & column vertices adjacent to it become fully connected – “bi-clique” (assuming diagonal pivot)•The edges of the bi-clique are the potential fills (upper bound !)
Unsym. Long search path In-place [Pagallo/Maulino ‘83]
Local Sym. Path length 2 In-place
Elimination models can be implemented using standard graphs or quotient graphs, with different cost in time & space.
SIAM CSE03, Feb 10-13, 2003 13
Minimum Priority MetricsMinimum Priority Metrics
Metrics are based on “approximate degree” in the sense of AMD, can be implemented efficiently
Almost the same cost using various metrics: Based on row & column counts:
PRODUCT (a.k.a. Markowitz), SUM, MIN, MAX, etc.
Minimum fill : areas associated with the existing cliques are deducted …...
SIAM CSE03, Feb 10-13, 2003 14
Preliminary Results with Local SymmetrizationPreliminary Results with Local Symmetrization
Matrices: 98 unsymmetric in structure
Metrics : based on row/column counts or fill
Solvers: MA41_NEW : unsymmetrized multifrontal
Local symmetrization ordering is ideal for this solver SuperLU_DIST : GESP
SIAM CSE03, Feb 10-13, 2003 15
Compare Different MetricsCompare Different Metrics
Solver: MA41_NEWAverage fill ratio using various metrics with respect to Markowitz
(product of row & col counts)
Metrics Mean fill ratio
SUM row & col counts 0.999
MAX row & col counts 6.079
MIN row & col counts 15.94
Approx. min fill (AMF1) 0.965
Approx. min fill (AMF4) 0.959
SIAM CSE03, Feb 10-13, 2003 16
Compare with AMD(A’+A) using Min Fill -- All Compare with AMD(A’+A) using Min Fill -- All UnsymmetricUnsymmetric
MA41_NEW
SuperLU_DIST
Fill ratio Flops ratio
Mean 0.96 0.92
Best / worst 0.41 / 1.27 0.13 / 2.38
Fill ratio Flops ratio
Mean 0.96 0.96
Best / worst 0.38 / 2.36 0.009 / 6.00
SIAM CSE03, Feb 10-13, 2003 17
Compare with AMD(A’+A) using Min Fill -- Very Compare with AMD(A’+A) using Min Fill -- Very UnsymmetricUnsymmetric
MA41_NEW
SuperLU_DIST
Fill Flops
Mean 0.88 0.77
Best / worst 0.38 / 1.18 0.009 / 1.69
Fill Flops
Mean 0.95 0.89
Best / worst 0.41 / 1.27 0.13 / 2.38
SIAM CSE03, Feb 10-13, 2003 18
SummarySummary
First implementation based on BQG model Features: supervariable, element absorption, mass elimination
Using approximate degree (degree upper bound)Tried various metrics on large collection of matrices
PRODUCT, SUM, MIN-FILL, etc. Not a single one is universally best, MIN-FILL is often better
Local symmetrization Cheaper to implement, harder to understand behavior Especially suitable for unsymmetrized multifrontal, also benefit GESP Respectable gain for very unsymmetric matrices
SIAM CSE03, Feb 10-13, 2003 19
Summary (con’d)Summary (con’d)
Results for very unsymmetric matrices
Future work Work underway for a fully unsymmetric version Extend to graph partitioning strategy