Top Banner
A Comparison of Several Direct Sparse Linear Equation Solvers for CGWAVE on the Cray X1 Fred T. Tracy and Thomas C. Oppe
48

A Comparison of Several Direct Sparse Linear Equation ...

May 25, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Comparison of Several Direct Sparse Linear Equation ...

A Comparison of Several Direct Sparse Linear Equation Solvers for CGWAVE on the Cray X1

Fred T. Tracy and Thomas C. Oppe

Page 2: A Comparison of Several Direct Sparse Linear Equation ...

Mild Slope Equation

( )

( )( )kdgk

kdkd

n

nCk

C

kC

CC

CC

g

gg

tanh

2sinh2

121

0ˆˆ

2

2

=

+=

=∂∂

=

=

=+⋅∇

σ

σ

σ

ηση

depth localnumberwave

velocitygroupvelocityphase

frequencywavefunctionelevationsurfacecomplexˆ

==

===

=

dk

CC

g

σ

η

Page 3: A Comparison of Several Direct Sparse Linear Equation ...

CGWAVE

Land

Computational domain

Open boundary

Exterior

Incident wave direction

Page 4: A Comparison of Several Direct Sparse Linear Equation ...

CGWAVE

• Wave prediction model• Harbors, open coastal regions, coastal inlets,

around islands, and around fixed or floating structures

• Finite element model• Unstructured mesh• Linear system of non-symmetric, complex

equations• Very difficult to solve by iterative methods

Page 5: A Comparison of Several Direct Sparse Linear Equation ...

Direct Solvers Compared

• Bansol• SSGETRF, SSGETRS• SSTSTRF, SSTSTRS• SuperLU• UMFPACK

bAx =

Page 6: A Comparison of Several Direct Sparse Linear Equation ...

Bansol

• Out-of-core• Banded• FORTRAN• Initial bandwidth reduction step• Complex A and b• Threshold pivoting• Directives were used for optimization on

the X1

Page 7: A Comparison of Several Direct Sparse Linear Equation ...

Reordering

1 2

3 4

Page 8: A Comparison of Several Direct Sparse Linear Equation ...

Reordering

1 2

3 4

5 6

Page 9: A Comparison of Several Direct Sparse Linear Equation ...

Reordering

1 2

3 4

5 6

7

Page 10: A Comparison of Several Direct Sparse Linear Equation ...

Reordering

1 2

3 4

5 6

7 8

9 10

Page 11: A Comparison of Several Direct Sparse Linear Equation ...

Node Number Reordering

• Before directive

• After directive

r V--<> temp(nos(1:nn)) = phi(1:nn)V M--<> phi(1:nn) = temp(1:nn)

!dir$ concurrentr V M--<> temp(nos(1:nn)) = phi(1:nn)V M----<> phi(1:nn) = temp(1:nn)

Page 12: A Comparison of Several Direct Sparse Linear Equation ...

Store into Blocks1 !csd$ parallel do private (jj, ii, 2 !csd$& nold, jold, node, i, j)1 M-----< do jj = 1, nm1 M MV--< do ii = 1, nn1 M MV nold = nns(ii)1 M MV jold = id(nold, jj)1 M MV if (jold .ne. 0) then1 M MV node = nos(jold)1 M MV if ((node .ge. jst) .and.

& (node .le. jend)) then1 M MV i = ii - node + nband1 M MV j = node - jst + 11 M MV aa(i, j) = a(nold, jj)1 M MV end if1 M MV end if1 M MV--> end do1 M-----> end do1 !csd$ end parallel do

Page 13: A Comparison of Several Direct Sparse Linear Equation ...

SSGETRF, SSGETRS

• General unsymmetric A• Sparse• Real A and b• Threshold pivoting• Optimized by Cray in SciLib

Page 14: A Comparison of Several Direct Sparse Linear Equation ...

SSGETRF, SSGETRS Steps

• Fill-reduction reordering• Symbolic factorization• Execution sequence and memory

management• Numerical factorization• Back substitution

Page 15: A Comparison of Several Direct Sparse Linear Equation ...

SSTSTRF, SSTSTRS

• Unsymmetric A with symmetric structure (typical FEM data)

• Sparse• Real A and b• No pivoting• Optimized by Cray in SciLib

Page 16: A Comparison of Several Direct Sparse Linear Equation ...

SSTSTRF, SSTSTRS Steps

• Fill-reduction reordering• Symbolic factorization• Execution sequence and memory

management• Numerical factorization• Back substitution

Page 17: A Comparison of Several Direct Sparse Linear Equation ...

SuperLU

• General unsymmetric A• Sparse• C• Real A and b• Threshold pivoting• No optimization on the X1

Page 18: A Comparison of Several Direct Sparse Linear Equation ...

SuperLU Steps

• Equilibrate• Preorder the rows of A• Order the columns of A• Numerical factorization• Back substitution

Page 19: A Comparison of Several Direct Sparse Linear Equation ...

UMFPACK

• General unsymmetric A• Sparse• Multifrontal method• Approximate Minimum Degree ordering• C• Real A and b• Threshold pivoting• No optimization on the X1

Page 20: A Comparison of Several Direct Sparse Linear Equation ...

UMFPACK Steps

• Preorder and symbolic analysis• Numerical factorization• Back substitution

Page 21: A Comparison of Several Direct Sparse Linear Equation ...

CGWAVE Data Sets

108.00.0288596.0Maxi |(b)i|

2.920.0004437.58Maxi |(xbm)i|

5831,487719New half BW

1,8291,487719Old half BW

496,286265,119130,255Nodes

event43p11run24aData Set ID

Page 22: A Comparison of Several Direct Sparse Linear Equation ...

Bansol – X1

316.2311.679.61.0286.3300.964.00.1

BS IO

12.210.96.71.012.011.66.70.1

BS comp.

678.9773.5194.21.0581.5762.9141.70.1

LU IO

413.91,415.9187.81.0415.81,416.1189.10.1

LU comp.

246.1317.673.7Wr. blocks3.21.60.7Band. red.

event43p11run24aTh

Page 23: A Comparison of Several Direct Sparse Linear Equation ...

Bansol – X1 (Th = 0.1)

0

500

1000

1500

2000

2500

3000

a p11run24 event43

Tim

e (s

ec) BS IO

BS computationLU IOLU computationWriting blocks

Page 24: A Comparison of Several Direct Sparse Linear Equation ...

Bansol – X1

1.13(10-12)

1.00(10-15)

1.38(10-11)

1.0

1.13(10-12)

1.25(10-15)

1.41(10-11)

0.1Maxi

|(b-Ax)i|

5.07(10-11)

1.90(10-15)

5.60(10-13)

1.0

5.07(10-11)

1.88(10-15)

5.59(10-13)

0.1Maxi

|(x-xbm)i|

1,665.62,826.9538.71.01,543.52,812.4478.70.1

Total

event43p11run24aTh

Page 25: A Comparison of Several Direct Sparse Linear Equation ...

Bansol – O3K

716.71,761.8221.81.0680.01,443.1220.60.1

BS IO

141.1211.948.71.0139.2216.549.50.1

BS comp.

794.02,128.1240.61.0702.02,134.5141.70.1

LU IO

2,256.419,145.31,313.01.02,264.719,130.71,316.00.1

LU comp.

379.8348.279.7Wr. blocks0.50.30.1Band. red.

event43p11run24aTh

Page 26: A Comparison of Several Direct Sparse Linear Equation ...

Bansol – O3K (Th = 0.1)

0

5000

10000

15000

20000

25000

a p11run24 event43

Tim

e (s

ec) BS IO

BS computationLU IOLU computationWriting blocks

Page 27: A Comparison of Several Direct Sparse Linear Equation ...

Bansol – O3K

1.20(10-12)

1.37(10-15)

1.61(10-11)

1.0

1.20(10-12)

1.52(10-15)

1.80(10-11)

0.1Maxi

|(b-Ax)i|

5.07(10-11)

1.90(10-15)

5.58(10-13)

1.0

5.07(10-11)

1.88(10-15)

5.55(10-13)

0.1Maxi

|(x-xbm)i|

4,295.223,596.61,909.91.04,158.723,271.71,893.40.1

Total

event43p11run24aTh

Page 28: A Comparison of Several Direct Sparse Linear Equation ...

Bansol Comparison

0

5000

10000

15000

20000

25000

a p11run24 event43

Tim

e (s

ec)

X1O3K

Page 29: A Comparison of Several Direct Sparse Linear Equation ...

SSGETRF/SSGETRS – X1

142.465.418.7Numerical LU

534.0152.273.4Total1.26

(10-10)4.37

(10-15)1.09

(10-10)Maxi

|(b-Ax)i|

10.95.82.9BS

77.939.721.4Preparation LU303.641.730.6Set up data

event43p11run24aTh = 0.1

Page 30: A Comparison of Several Direct Sparse Linear Equation ...

SSGETRF/SSGETRS – X1

0

100

200

300

400

500

600

a p11run24 event43

Tim

e (s

ec) BS

Numerical LUPrep. LUSet up data

Page 31: A Comparison of Several Direct Sparse Linear Equation ...

SSTSTRF/SSTSTRS – X1

231.9106.433.1Numerical LU

610.8185.083.4Total2.13(10-8)

1.73(10-14)

3.82(10-9)

Maxi

|(b-Ax)i|

10.35.42.7BS

63.531.717.5Preparation LU305.041.530.1Set up data

event43p11run24a

Page 32: A Comparison of Several Direct Sparse Linear Equation ...

SSTSTRF/SSTSTRS – X1

0

100

200

300

400

500

600

700

a p11run24 event43

Tim

e (s

ec) BS

Numerical LUPrep. LUSet up data

Page 33: A Comparison of Several Direct Sparse Linear Equation ...

General vs. Symmetric Structure Comparison

0

100

200

300

400

500

600

700

General Symmetric Structure

Tim

e (s

ec) BS

Numerical LUPrep. LUSet up data

Page 34: A Comparison of Several Direct Sparse Linear Equation ...

General vs. Symmetric Structure Comparison

0

100

200

300

400

500

600

General Symmetric Structure

Error (X 10^-12)

Page 35: A Comparison of Several Direct Sparse Linear Equation ...

SuperLU – X1

1,690.8710.5242.7Total5.10

(10-10)4.53

(10-15)2.66

(10-10)Maxi

|(b-Ax)i|

65.436.313.2BS1,322.4633.2199.4LU305.941.030.1Set up data

event43p11run24aTh = 0.1

Page 36: A Comparison of Several Direct Sparse Linear Equation ...

SuperLU – X1

0

300

600

900

1200

1500

1800

a p11run24 event43

Tim

e (s

ec)

BSLUSet up data

Page 37: A Comparison of Several Direct Sparse Linear Equation ...

SuperLU – O3K

2,171.5612.697.8Total5.51

(10-11)54.7

(10-16)3.56

(10-10)Maxi

|(b-Ax)i|

443.36.12.2BS1,706.7603.493.8LU

19.12.91.8Set up dataevent43p11run24aTh = 0.1

Page 38: A Comparison of Several Direct Sparse Linear Equation ...

SuperLU – O3K

0

500

1000

1500

2000

2500

a p11run24 event43

Tim

e (s

ec)

BSLUSet up data

Page 39: A Comparison of Several Direct Sparse Linear Equation ...

SuperLU Comparison

0

500

1000

1500

2000

2500

a p11run24 event43

Tim

e (s

ec)

X1O3K

Page 40: A Comparison of Several Direct Sparse Linear Equation ...

UMFPACK – X1

496.5240.571.0Numerical LU

1,105.9431.2151.0Total5.43

(10-10)5.06

(10-15)1.85(10-9)

Maxi

|(b-Ax)i|

9.14.21.7BS

294.6143.947.7Preparation LU305.242.130.4Set up data

event43p11run24aTh = 0.1

Page 41: A Comparison of Several Direct Sparse Linear Equation ...

UMFPACK – X1

0

200

400

600

800

1000

1200

a p11run24 event43

Tim

e (s

ec) BS

Numerical LUPrep. LUSet up data

Page 42: A Comparison of Several Direct Sparse Linear Equation ...

UMFPACK – O3K

325.8146.327.5Numerical LU

363.1163.033.8Total1.47(10-9)

1.91(10-14)

3.20(10-9)

Maxi

|(b-Ax)i|

4.01.90.7BS

28.613.64.9Preparation LU4.71.20.7Set up data

event43p11run24aTh = 0.1

Page 43: A Comparison of Several Direct Sparse Linear Equation ...

UMFPACK – O3K

0

50

100

150

200

250

300

350

400

a p11run24 event43

Tim

e (s

ec) BS

Numerical LUPrep. LUSet up data

Page 44: A Comparison of Several Direct Sparse Linear Equation ...

UMFPACK Comparison

0

200

400

600

800

1000

1200

a p11run24 event43

Tim

e (s

ec)

X1O3K

Page 45: A Comparison of Several Direct Sparse Linear Equation ...

Comparison of Solvers – X1

0

500

1000

1500

2000

2500

3000

a p11run24 event43

Tim

e (s

ec) Bansol

SSGETRFSSTSTRFSuperLUUMFPACK

Page 46: A Comparison of Several Direct Sparse Linear Equation ...

Comparison of Solvers – O3K

0

5000

10000

15000

20000

25000

a p11run24 event43

Tim

e (s

ec)

BansolSuperLUUMFPACK

Page 47: A Comparison of Several Direct Sparse Linear Equation ...

Accuracy – X1 (Infinity Norm)

1.13126

2130

510 543

0

500

1000

1500

2000

2500

event43

Err

or (

X 1

0^-1

2) BansolSSGETRFSSTSTRFSuperLUUMFPACK

Page 48: A Comparison of Several Direct Sparse Linear Equation ...

Questions?