Sou-Cheng (Terrya) Choi - IITMeshfree-methods-seminar/presentations/talk_20140521... · I Nightly build I Client-server functionality tests I Load testing I Documentation I Continuous

IntroductionAfter Math

PracticesLast Words

Good Practices for Mathematical Software

Sou-Cheng (Terrya) ChoiNORC at UChicago, IIT www.iit.edu/~schoi32

Meshfree Methods Seminar

Dept. of Applied Mathematics, IIT

Thanks: GAIL Team, MATH573 (Fall 2013)

May 21, 2014

Sou-Cheng Terrya Choi [email protected] Good Practices for Mathematical Software May 21, 2014 1 / 33

www.iit.edu/~schoi32

[email protected]


PracticesLast Words

Memorable Memorial Day!GAIL TeamMission statement

Memorable Memorial Day!

All gave some

Some gave all

More do some

Some do more


[email protected]


PracticesLast Words


The GAIL Company

CEO:Fred Hickernell

EngineeringManager/Mislead

Release Lead: YuhanDing

GAIL sites master:Lan Jiang

Repository Specialist:Yizhi Zhang

DocumentationLead:

Xuan Zhou

Test Lead: TonyJimenez Rugama

Alumni: XinchengSheng


[email protected]


PracticesLast Words


About GAIL—words from the CEO, May 6, 2013I Before GAIL: Automatic numerical integration algorithms have inherent flaws

in their error estimation based on balls of integrands.I GAIL overcomes the flaws by considering cones of integrands. This allows us

to construct upper bounds on costs of our integration routines with rigorousguarantees of accuracy and develop algorithms that provide the value of theintegral with an error of no more than the user-defined tolerance.

I Mission (possible): To create a well-documented and well-tested library ofunivariate & multivariate integration routines that have rigorous guarantees

I GAIL version 1: By the end of the summer we hope to have our automaticroutines for function recovery, univariate integration, and Monte Carloestimation of mean on the GAIL site in good form, meaning that theseroutines need to be

I well-documentedI well-testedI optimized for speedI accompanied by examplesI in a repository where they can be modified and re-tested as needed

I Later we will improve and add to these routines.Sou-Cheng Terrya Choi [email protected] Good Practices for Mathematical Software May 21, 2014 4 / 33

[email protected]


PracticesLast Words


Getting Things Done . . .DONE TODO


[email protected]


PracticesLast Words


GAIL Milestones & Targets

I First-year milestonesI Sep 3, 2013: Release of GAIL version 1I Sep 4, 2013: Development of GAIL version 1.3 commencedI Fall 2013: MATH 573 Seminar/Elective “Reliable Mathematical Software”

(Instructors: Fred Hickernell & C. Students: 7 registered, 2 regular sit-in)I ♥ Feb 14, 2014: Release of GAIL version 1.3I Feb 15: Development of GAIL version 2.0 commenced

I Targets for the next few yearsI July 9–10: SIAM Annual Meeting, Minisymposium on “Reliable Mathematical

Software”I Labor Day, Sep 1: Release of GAIL version 2I Feb 2015: Release of GAIL version 2.5I Sep 2015: Release of GAIL version 3I TBD: Apply for a research grant for GAIL


[email protected]


PracticesLast Words

RecapReproducible ResearchReliable Reproducible Research and Staunch Scientific Software

GAIL algorithms at a glance

Algorithms Developers GAIL functions Versions

Function recovery Yuhan funappx g 1.0–2.0

Univariate integration Yizhi integral g 1.0–2.0

Monte Carlo mean estimation Lan meanMC g 1.0–2.0

meanMCBinomial g 2.0

Multivariate integration Lan cubMC g 1.3–2.0

Univariate optimization Xin funmin g 2.0

Multivariate function recovery Xuan funappxmulti g 2.0

Quasi Monte Carlo Tony cubQMC g 2.0

Multi-level Monte Carlo Aleks cubQMLMC g 2.0


[email protected]


PracticesLast Words


A “disintegrating” integral

0 0.2 0.4 0.6 0.8 1

−0.2

0

0.2

0.4

0.6

0.8

1

fdata

Spiky f with I =∫ 1

0f(x)dx ≈ 0.3694.

quad(f,[0,1],1e-14) = 0 giving error = I.Strategy: ↑ number of points to ↓ error

Q: How many number of points (or function evaluations), n, are re-quired in a quadrature rule to guarantee that a given error tolerance,ε is met with a confidence level 1− α?A: Clancy et al. 2013, [CDH+14b], Hickernell et al. 2014 [HJLO14],GAIL 1.0 [CDH+13], GAIL 1.3 [CDH+14a]


[email protected]


PracticesLast Words


Reproducible research (RR) pioneers and champions

Jon ClaerboutSEP (Stanford Exploration Project,

1973–present)

David DonohoWaveLab (1995), BeamLab (2004),

SparseLab (2007), MCALab (2008)


[email protected]


PracticesLast Words


What is reproducible research (RR)?

I Claerbout, “The markings [ER], [CR], and [NR] are promises by theauthor(s) about the reproducibility of each figure result.” (url:j.mp/VM7Xq4)

I ER: Easily reproducible: “programs, parameters, and makefiles . . . data”I CR: Conditionally reproducible: “processing requires 20 minutes or more, or

commercial packages”I NR: Not reproducible: drawingsI M: Movie “in a figure”; ER, CR, or NR

I Donoho paraphrasing Claerbout, “an article about computational result isadvertising, not scholarship. The actual scholarship is the full softwareenvironment, code and data, that produced the result.” (Buckheit &Donoho 1995)


[email protected]


PracticesLast Words


Examples of RR—ER

Lomask and Fomel 2006


[email protected]


PracticesLast Words


Bernd Flemisch’s survey results (Aug 2013)Online Survey: Reproducibility in ComputationalScience and Engineering (CSE)

13 questions on opinions and experiences concerning the reproducibility ofcomputational results.Results collected on August 1.

Direct EmailsCall went out on July 5 to ∼ 500 addresses.Resulted in ∼ 80 answers.

InterPore NewsletterNewsletter was sent out on July 6 to ∼ 1000 addresses.Resulted in 2 answers.

SIAM Activity Group on CSE Mailing ListCall went out on July 10 to ∼ 2000 addresses.Resulted in ∼ 300 answers.

Survey Results I (n = 385)

0% 20% 40% 60% 80%Yes

No

I understand what the reproducibilityof computational results means:

0% 20%40%60%very important 5

4

3

2

not important 1

I consider the reproducibility ofcomputational results to be ...

0% 20% 40% 60%Yes

No

I don’t know

The importance ... is sufficientlyreflected by today’s journal policies:

0% 10% 20% 30%

very high 5

4

3

2

very little 1

I don’t know

The effort it would take me/others toreproduce my computational results:

0% 20% 40% 60%

Yes

No

I already had problems with reproducingcomputational results of my own/others:

0% 10% 20% 30% 40% 50%More

Equal

Less

Compared to the average, I think that Iinvest ... effort in ensuring reproducibility:

Survey Results II (n = 385)

0% 20% 40% 60%I don’t have/need a strategy

Detailed description in my papers

Use of version control

I make the problem data available

I make the source code available

Use of tools like Madagascar

Other

My strategy to ensure the reproducibility of my comp. results:

0% 20% 40%No personal interest

No scientific interest

No requirement

No reward

No necessity

Other

Reasons for not devoting more effortto foster the reproducibility ...:

0% 10% 20% 30%81 – 100%

61 – 80%

41 – 60%

21 – 40%

0 – 20%

The ratio of working hours I devote to coding(including thinking and talking about coding):

10% 30%Professor

Postdoc

PhD student

BSc/MSc student

Other

My current education/position:

0% 10% 20% 30%> 60

51 – 60

41 – 50

31 – 40

21 – 30

≤ 20

My age in years:

Survey Results: A Slightly Deeper LookThe estimation of the effort to reproduce does not influence the estimationof the importance of reproducibility.The effort estimated for oneself influences the effort estimated for others,and the effort for the others is considered to be higher.The estimated effort to reproduce does not influence the number ofemployed strategy items.The amount of work related to coding influences the estimation of theimportance, but not the number of employed strategy items.Age does not have an influence on the quantitative results, apart from thetime devoted to coding.

5very high

4 3 2 1very little

I don’tknow

0%

10%

20%

30%

The effort it would take me to reproduce my computational results from three years ago:

problems reproducing own results (n = 200)no problems reproducing own results (n = 185)


[email protected]


PracticesLast Words


How reliable or limiting is RR?

I Bounded above by the underlying theory, data, code, and software

I Platform and version dependent

I Less reliable over time due to new software versions

I Lack or loss of data or code; confidential/sensitive data

I Big data inputs hard to clean; outputs corruptable by sharing or transferringprocesses

I Commercial software expensive with many restrictions—Cf. quality freesoftware

I Big (binary) code of poor design or documentation, slow to run

I Lack of testing

I No bug-fix patches or slow new releases

I Lack of communication, community support & feedback

I Hamper creativity and/or productivity?


[email protected]


PracticesLast Words


Reliable RR via Staunch Scientific Software (SSS)

SSS heavyweights

Richard StallmanGNU, Free software movement

Ian FosterFather of the Grid, Globus Online,

Galaxy, SWIFT, CIM-EARTH


[email protected]


PracticesLast Words


Scientific research vs. Software engineeringSciences:

I Wavelets

I Signal processing

I Image processing

I Biostatistics

I PDEs

I Economics

I Physics

I Chemistry

I Mathematics

I Algorithms...

I Monte Carlo simulation

I Numerical integration

Software:

I Test-driven development

I Object oriented design

I API and GUI

I Software reuse

I Logging, error handling

I Paired programming

I Nightly build

I Client-server functionality tests

I Load testing

I Documentation

I Continuous release

I Research project websites

I Licenses (BSD), copyleft


[email protected]


PracticesLast Words

RepositoryInstallationAPIs, input parsing, messagesDocumentationTests

GAIL team working with a remote, central repository

LaninitializedGAIL-devrepository

Everybody clonedthe repository

Yizhi worked onhis local

repository

Yuhan worked onher local

repository

Yizhi publishedhis code

Yuhan could notpublish her code

Yuhan pulledYizhi’s commits

Yuhan resolvedmerge conflicts

Yuhan publishedher code

Pull beforepush

Image credit: https://www.atlassian.com/git/workflows


https://www.atlassian.com/git/workflows

[email protected]


PracticesLast Words


Interacting with our GAIL development repository

Use SourceTree


[email protected]


PracticesLast Words


Managing a repository

Avoid checking in binary files

text text2

text text2

binary binary2

binary binary2

pull

edit

push difference pull

edit

push whole file

“Cleaniness is next to godliness”

Image soure: http://bit.ly/1i4p7by

Credits: XuanSou-Cheng Terrya Choi [email protected] Good Practices for Mathematical Software May 21, 2014 18 / 33

[email protected]


PracticesLast Words


GAIL Central, http://code.google.com/p/gail/

Credits: Lan


http://code.google.com/p/gail/

[email protected]


PracticesLast Words


GAIL home directory & help

One step to install:

>> DownloadInstallGail_1_3_0

The GAIL package is now being downloaded...

>> help GAIL_1_3_0

GAIL_MATLAB

Files

GAILstart - Initialize all the GAIL paths and system parameters.

GAIL_Install - Install GAIL. Add GAIL paths to MATLAB search path.

GAIL_Reinstall - Reinstall GAIL. Remove existing GAIL paths and add new ones.

GAIL_Uninstall - Uninstall GAIL. Remove GAIL paths from MATLAB search path

LICENSE - License of GAIL

README - Installation of and introduction to GAIL.

Folders

Algorithms - GAIL algorithms

Documentation - GAIL documentation

OutputFiles - Output generated by GAIL routines

Papers - Papers and slides related to GAIL

ThirdParty - Open-source tools used but not produced by the GAIL team

UnitTests - Unit tests of GAIL algorithms

Utilities - Tools for the GAIL package

Workouts - Workouts of GAIL algorithms


[email protected]


PracticesLast Words


GAIL’s Application Programming Interfaces (APIs)

Our key algorithms have three API patterns:

[x, out_param] = algo_g(f, in_param);

[x, out_param] = algo_g(f, inputVal1, ..., inputValn );

[x, out_param] = algo_g(f, ’input1’, inputVal1, ..., ’inputn’, inputValn );

f: compulsory, function handlein param, out param: optional, structuresinput1,. . ., inputn: optional, stringinputVal1,. . ., inputValn: optional, numeric

Automatically correct out-of-range input values with warning or error messages to users

Credits: Fred


[email protected]


PracticesLast Words


GAIL help function>> help funappx_g

FUNAPPX_G 1-D guaranteed function recovery on closed interval [a,b]

fappx = FUNAPPX_G(f) recovers function f on the default interval [0,1]

by a piecewise linear interpolant fappx to within the guaranteed

absolute error tolerance of 1e-6. Default initial number of points is

100 and default cost budget is 1e7. Input f is a function handle. The

statement y=f(x) should accept a vector argument x and return a vector

y of function values that is the same size as x.

fappx = FUNAPPX_G(f,a,b,abstol,nlo,nhi,nmax) for given function f and

the ordered input parameters that define the finite interval [a, b], a

guaranteed absolute error tolerance bstol, lower bound of initial

number of points nlo, upper bound of initial number of points nhi, and

cost budget nmax. nlo and nhi can be input as a vector or just one

value as an initial number of points.

fappx = FUNAPPX_G(f,’a’,a,’b’,b,’abstol’,abstol,’nlo’,nlo,’nhi’,nhi,’nmax’,nmax)

recovers function f on the finite interval [a, b], guaranteed absolute

error tolerance abstol, lower bound of initial number of points nlo,

upper bound of initial number of points nhi, and cost budget nmax. All

six field-value pairs are optional and can be supplied in different

order.

fappx = FUNAPPX_G(f,in_param) recovers function f on the finite

interval [in_param.a, in_param.b], guaranteed absolute error tolerance

in_param.abstol, lower bound of initial number of points in_param.nlo,

upper bound of initial number of points in_param.nhi, and cost budget

in_param.nmax. If a field is not specified, the default value is used.

in_param.a --- left end point of interval, default value is 0

.

.

. (More content, omitted)Sou-Cheng Terrya Choi [email protected] Good Practices for Mathematical Software May 21, 2014 22 / 33

[email protected]


PracticesLast Words


GAIL’s searchable HTML documentation

Credits: Yuhan


[email protected]


PracticesLast Words


GAIL doctest for documentation examples

>> doctest funappx_g

TAP version 13

1..9

ok 1 - "f = @(x) x.^2; [fappx, out_param] = funappx_g(f)"

ok 2 - "f = @(x) x.^2;"

ok 3 - "[fappx, out_param] = funappx_g(f,-2,2,1e-7,10,10,1000000)"

ok 4 - "f = @(x) x.^2;"

ok 5 - "[fappx, out_param] = funappx_g(f,’a’,-2,’b’,2,’nhi’,100,’nlo’,10)"

ok 6 - "clear in_param; in_param.a = -10; in_param.b = 10; "

ok 7 - "in_param.abstol = 10^(-7); in_param.nlo = 10; in_param.nhi = 100;"

ok 8 - "in_param.nmax = 10^6; f = @(x) x.^2;"

ok 9 - "[fappx, out_param] = funappx_g(f,in_param)"


[email protected]


PracticesLast Words


MATLAB unit tests

Every time the code is changed, unit tests are run.

Every time a bug is found, unit tests are expanded.

>> results = run(ut_funappx_g)

Running ut_funappx_g

....

Done ut_funappx_g

__________

results =

1x4 TestResult array with properties:

Name

Passed

Failed

Incomplete

Duration

Totals:

4 Passed, 0 Failed, 0 Incomplete.

0.10896 seconds testing time.

classdef ut_funappx_g < matlab.unittest.TestCase

methods(Test)

function funappx_gOfx(testCase)

f = @(x) x;

in_param.tol = 10^(-8);

in_param.tau = 15;

in_param.Nmax = 10^6;

[appxf, result] = funappx_g(f,in_param);

x = sqrt(2)-1;

actualerr = abs(appxf(x)-f(x));

testCase.verifyLessThanOrEqual...

(actualerr,in_param.tol);

testCase.verifyLessThanOrEqual...

(result.npoint,in_param.Nmax);

end

.

.

.(More test cases, omitted)


[email protected]


PracticesLast Words


GAIL 1.3 test suite

>> runtests

TAP version 13

1..9

ok 1 - "f = @(x) x.^2; [fappx, out_param] = funappx_g(f)"

ok 2 - "f = @(x) x.^2;"

ok 3 - "[fappx, out_param] = funappx_g(f,-2,2,1e-7,10,10,1000000)"

ok 4 - "f = @(x) x.^2;"

ok 5 - "[fappx, out_param] = funappx_g(f,’a’,-2,’b’,2,’nhi’,100,’nlo’,10)"

ok 6 - "clear in_param; in_param.a = -10; in_param.b = 10; "

ok 7 - "in_param.abstol = 10^(-7); in_param.nlo = 10; in_param.nhi = 100;"

ok 8 - "in_param.nmax = 10^6; f = @(x) x.^2;"

ok 9 - "[fappx, out_param] = funappx_g(f,in_param)"

...

ok 1 - "f=@(x) sin(x);interval = [1;2];"

ok 2 - "Q = cubMC_g(f,interval,’uniform’,1e-3)"

ok 3 - "f=@(x) exp(-x(:,1).^2-x(:,2).^2);hyperbox = [0 0;1 1];"

ok 4 - "Q = cubMC_g(f,hyperbox,’uniform’,1e-3)"

ok 5 - "d=3;f=@(x) 2^d*prod(x,2)+0.555;hyperbox = [zeros(1,d);ones(1,d)];"

ok 6 - "Q = cubMC_g(f,hyperbox,’uniform’,1e-3)"

0.4842


[email protected]


PracticesLast Words


Measuring and improving performanceMATLAB software development/analysis tools

I Matlab ProfilerI Matlab MexI Matlab 2013 Unit Testing FrameworkI Matlab Central (url: j.mp/anwdaP)I Matlab mex to interface with C/Fortran functionsI Parallelization with SWIFTI Matlab reports


[email protected]


PracticesLast Words

ConclusionsQ & A

Conclusions1. RR is a philosophy that also serves as a practical guide for producing quality

scholarly publications in computational sciences

2. SSS involves a number of software engineering principles and tools that enableexperiments and actions. Open source plays a key role

3. RR is made more reliable by SSS development

4. Numerical quadrature with a guarantee of accuracy is new and worthy

5. GAIL is to demonstrate the theorems and make an impact

6. Eventually and ideally, to be introduced into first NA/SC courses

7. Handling and producing data/code/software taken to the extreme are challengingresearch and practical problems

8. HPC, mobile apps, industrial employment

9. Please cite high-quality software

10. Think a wedge (cone), not a whole (ball)

Image credit: http://bit.ly/19GRS91Sou-Cheng Terrya Choi [email protected] Good Practices for Mathematical Software May 21, 2014 28 / 33

http://bit.ly/19GRS91

[email protected]


PracticesLast Words

ConclusionsQ & A

Wish list

Would you be our Santa?

I User guide

I Stress tests

I Real-world applications

I Automation of HTML help

I Automation of test suites

I Graphical user interfaces to guide user inputs

I Algorithmic usage measurements and citation reminders

I Publish GAIL 2.0 with ACM TOMS


[email protected]


PracticesLast Words

ConclusionsQ & A

Readings

About GAIL: [CDH+14b, HJLO14, CDH+14a]

About reliable reproducible research via staunch / sustainable scientific software:[CH13, Cho13, KCL+14]


[email protected]


PracticesLast Words

ConclusionsQ & A

Questions & Feedback

“God does not care about our mathematical difficulties. He integratesempirically.”—Albert Einstein

Thank you!


[email protected]

References I

Sou-Cheng T. Choi, Yuhan Ding, Fred J. Hickernell, Lan Jiang, and YizhiZhang, GAIL: Guaranteed Automatic Integration Library (version 1),MATLAB software, 2013.

, GAIL: Guaranteed Automatic Integration Library (version 1.3),MATLAB software, 2014.

N. Clancy, Y. Ding, C. Hamilton, F. J. Hickernell, and Y. Zhang, Thecomplexity of guaranteed automatic algorithms: Cones, not balls, Journal ofComplexity 30 (2014), 21–45.

Sou-Cheng T. Choi and Fred J. Hickernell, IIT MATH-573 ReliableMathematical Software, 2013, seminal course slides.

Sou-Cheng T. Choi, MINRES-QLP Pack and reliable reproducible research viastaunch scientific software, First Workshop on Sustainable Software forScience: Practice and Experiences (Denver, Colorado, USA), 2013.


[email protected]

References II

F. J. Hickernell, L. Jiang, Y. Liu, and A. B. Owen, Guaranteed conservativefixed width confidence intervals via Monte Carlo sampling, Monte Carlo andQuasi-Monte Carlo Methods 2012 (J. Dick, F. Y. Kuo, G. W. Peters, andI. H. Sloan, eds.), Springer-Verlag, Berlin, 2014, to appear, arXiv:1208.4318[math.ST].

Daniel S. Katz, Sou-Cheng T. Choi, Hilmar Lapp, Ketan Maheshwari, FrankLoffler, Matthew Turk, Marcus Hanwell, Nancy Wilkins-Diehr, JamesHetherington, James Howison, Shel Swenson, Gabrielle D. Allen, Anne C.Elster, Bruce Berriman, and Colin Venters, Summary of the First Workshopon Sustainable Software for Science: Practice and Experiences (WSSSPE1),Technical report, 2014, submitted to the Journal of Open Research Software.


[email protected]

Sou-Cheng (Terrya) Choi - IITMeshfree-methods-seminar/presentations/talk_20140521... · I Nightly build I Client-server functionality tests I Load testing I Documentation I Continuous

Documents