Top Banner
35

CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Mar 31, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and
Page 2: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

CISL Mission for User Support

CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and access to facilities and services above normal service levels.

CISL will provide a balanced set of services to enable researchers to utilize community resources securely, easily, and effectively. CISL Strategic Plan

Page 3: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

CISL Facilities Overview

¨  HPC Systems ¤  Yellowstone ¤  Cheyenne

¨  Data Storage and Archival ¤  Glade ¤  HPSS

¨  Data Analysis and Visualization ¤  Geyser and Caldera

¨  Allocations ¨  Additional Training Resources ¨  Contacting User support ¨  Questions

Page 4: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Yellowstone

Page 5: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Yellowstone - IBM iDataPlex

¨  Compute Nodes ¤  4,518 Compute Nodes ¤  16 cores, 2.6 GHz Intel Sandy Bridge per node ¤  32 GB 1600 MHz DDR3 Memory per node ¤  72,288 Total Cores, 144 TB Total Memory ¤  6 login nodes (yslogin1, yslogin2, … yslogin6)

¨  Interconnect ¤  Mellanox FDR Infiniband Network ¤  Fat-tree topology

¨  Uses GLADE file system ¨  Available through 2017 ¨  More info on yellowstone

¤  https://www2.cisl.ucar.edu/resources/computational-systems/yellowstone

Page 6: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Cheyenne

Page 7: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Cheyenne – SGI ICE XA

¨  Compute Nodes ¤  4,032 Compute Nodes ¤  36 cores, 2.3 GHz Intel Broadwell cores per node ¤  2400 MHz DDR4 Memory per node

n  64 GB on 3168 nodes n  128 GB on 864 nodes

¤  145,152 Total Cores, 313 TB Total Memory ¤  6 login nodes (cheyenne1, cheyenne2, … cheyenne6)

¨  Interconnect ¤  Mellanox EDR Infiniband Fabric ¤  9D hypercube topology

¨  Uses GLADE file system ¨  Available through ~2020 ¨  More info on Cheyenne

¤  https://www2.cisl.ucar.edu/resources/computational-systems/cheyenne

Page 8: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

GLADE Shared File System

GLADE is a high performance GPFS file system shared across all CISL HPC and DAV computers

File space Quota Back Up

Purge Period Description

/glade/u/home/username 25 GB Yes None Home directories

/glade/scratch/username 10 TB No 45 days Temporary space for short term use

/glade/p/work/username 512 GB No None Work space for longer term storage

/glade/p/project_code N/A No None Shared space for project allocations

Page 9: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

System Access with Yubikey

¨  Use Secure Shell (ssh) to log on with yubikey ¤  Terminal, Cygwin, PuTTY, etc

¨  Using your Yubikey token ¤  When you log in to cheyenne ssh –X <username>@cheyenne.ucar.edu your screen displays a response: Token_Response: ¤  Enter your PIN number on the screen (do not hit enter), then touch the

yubikey button. This will insert a new one-time password (OTP) and a return. ¤  The yubikey is activated by a light touch, don’t need to press hard.

¨  More information on Yubikey: http://www2.cisl.ucar.edu/docs/yubikeys

Page 10: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Supported Shells

¨  Two primary (supported) shells ¤  tcsh ¤  bash

¨  The bash shell is default for new accounts, tcsh may be carried over from previous accounts

¨  Change default shell in SAM (System Account Manager) ¤  https://sam.ucar.edu

¨  More information http://www2.cisl.ucar.edu/resources/yellowstone/quick_start

Page 11: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Machine Specific Start Files (bash)

¨  Settings for ~/.profile:

alias rm=“rm -i”

PS1="\u@\h:\w> "

if [[ $HOSTNAME == yslogin* ]]; then

# Yellowstone settings

alias bstat=“bjobs -u all”

else

# Cheyenne settings

alias qjobs=“qstat -u $USER”

fi

Page 12: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Machine Specific Start Files (tcsh)

¨  Settings for ~/.tcshrc

tty > /dev/nullif ( $status == 0 ) then alias rm “rm -i” set prompt = "%n@%m:%~"

if ( $HOSTNAME =~ yslogin* ) then # Yellowstone settings alias bstat “bjobs -u all” else # Cheyenne settings alias qjobs “qstat -u $USER” endifendif

Page 13: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Login Node Usage

¨  The login nodes are shared by all HPC users, and there are multiple user work loads running on each login node

¨  Your programs compete with those of other users for cores and memory

¨  Login node usage should be limited to: ¤  Reading and writing text/code ¤  Compiling programs ¤  Performing small data transfers ¤  Interacting with the job scheduler

¨  Programs that use excessive resources on the login nodes will be terminated

¨  More intensive work should be run on the shared nodes, DAV nodes, or the compute nodes

Page 14: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Supported Compilers

CISL supports multiple compilers on HPC systems ¨  Intel C, C++, Fortran

¤  icc, icpc, ifort, mpiicc, mpiicpc, mpiifort

¨  GNU C, C++, Fortran ¤  gcc, g++, gfortran, mpigcc

¨  Portland Group C, C++, Fortran (Yellowstone only) ¤  pgcc, pgCC, pgfortran, pgf90, mpipcc, mpipf90

More information: http://www2.cisl.ucar.edu/resources/yellowstone/code_dev/compiling

Page 15: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Supported MPI Libraries

¨  Multiple MPI Libraries are available on Cheyenne ¤ SGI MPT ¤  Intel MPI ¤ OpenMPI ¤ MPICH

¨  The MPI Libraries available depend on the compiler you are using

¨  Switching compilers and libraries (and other sotware) is done using the modules command.

Page 16: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Commonly Used Software

¨  BLAS - Basic Linear Algebra Subroutines ¨  HDF5 - Hierarchical Data Format ¨  LAPACK and ScaLAPACK ¨  MKL - Math Kernel Library of general-purpose math routines ¨  GSL - GNU Scientific Library for C and C++ programmers ¨  NetCDF - Network Common Data ¨  PnetCDF – Parallel netCDF ¨  NCL – NCAR Command Language ¨  CDO – Climate Data Operators ¨  IDL – Interactive Data Language ¨  R – Statistical Computing Environment ¨  Python – Scripting Language ¨  Matlab – High Level Interactive Mathematical Environment

Page 17: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Using Modules

¨  Modules help manage user software, including compilers, libraries and dependencies between them.

¨  The software available through modules is hierarchical and organized in a tree. The software you have loaded will determine what branch of the tree you are in.

¨  The primary dependencies are due to compilers and MPI libraries.

¨  Software available within a branch should be mutually compatible.

Page 18: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Using Modules

Helpful module commands: ¨  module av

- lists available modules ¨  module list

- show the modules currently loaded ¨  module load/unload <module-name>

- load module module-name into the environment ¨  module swap <module 1> <module 2>

- swap module 1 for module 2 ¨  module help

- display help on module commands ¨  module help <module-name>

- display help specific to module-name

Page 19: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Using Modules

Helpful module commands (continued) ¨  module whatis <module-name>

- short info on module-name ¨  module save <set-name>

- save currently loaded module set as set-name ¨  module restore <set-name>

- reload all modules in saved set set-name ¨  module purge

- remove all loaded modules from environment ¨  module reset

- reset module environment More Info: http://www2.cisl.ucar.edu/resources/software/modules

Page 20: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Job Control With PBS

¨  Job submission (qsub) ¤  qsub script_name

¨  Job Monitoring (qstat) ¤  qstat

¤  qstat <job id>¤  qstat -u $USER

¤  qstat -Q <queue>

¨  Job Removal (qdel) ¤  qdel <job id>

Page 21: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Example PBS Job Script

#!/bin/bash

#PBS -N WRF_PBS

#PBS -A <project>

#PBS -q regular

#PBS -l walltime=00:30:00

#PBS -l select=4:ncpus=36:mpiprocs=36:ompthreads=1

#PBS -j oe

#PBS -o log.oe

# Run WRF with SGI MPT

mpiexec_mpt ./wrf.exe

For more info: https://www2.cisl.ucar.edu/resources/computational-systems/cheyenne/running-jobs/submitting-jobs-pbs

Page 22: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

HPSS

Page 23: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

HPSS Introduction

¨  High Performance Storage System (100+ PB of storage) ¨  Hierarchical Storage Interface (HSI) is the primary interface for

data transfer to/from HPSS along with metadata access and data management.

¨  HPSS Tape Archiver (HTAR) is used to package files on your file system to a single archive file and then send it to HPSS.

¨  HPSS is used for long term archiving of files, not for short term temporary storage

Page 24: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Hierarchical Storage Interface (HSI)

¨  POSIX like interface ¨  Different ways to invoke HSI

¤  Command line invocation n  hsi cmdn  hsi cget hpssfile (from your default dir on HPSS) n  hsi cput myfile (to your default dir on HPSS)

¤  Open an HSI session n  hsi to start a session; end, exit, quit to stop session. n  restricted shell-like environment

¤  hsi “in cmdfile” n  File of commands scripted in “cmdfile”

¨  Navigating HPSS while in HSI session ¤  On HPSS file system: pwd , cd, ls, , mkdir, cdls ¤  On GLADE file system: lpwd, pcd, lls, lmkdir, lcdls

¨  More info http://www2.cisl.ucar.edu/docs/hpss/hsi

Page 25: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Data Transfer Individual Files

¨  Writing data – cput command ¤  [HSI]/home/user1> cput file.01¤  [HSI]/home/user1> cput file.01 : new.hpss.file

¨  Reading data – cget command ¤  [HSI]/home/user1-> cget file.01¤  [HSI]/home/user1-> cget file.01 : hpss.file

¨  Can also use wildcards for cget/cput ¤  [HSI]/home/user1-> cget *.nc

Page 26: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Data Transfer: HTAR for File Collections

¨  Use HTAR for large numbers of files or file trees ¤  htar -c –f ex_dir.tar ex_dir

¨  To list contents of an HTAR archive ¤  htar -t –f ex_dir.tar

¨  Can add/retrieve files to/from an HTAR archive without downloading the whole archive

More Info: https://www2.cisl.ucar.edu/docs/hpss/htar

Page 27: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

HPSS Help

¨  To get help with an HSI command that takes at least one argument, just type the command name without arguments ¤  [HSI]/home/user1-> cput

¨  To get help with an HSI Command that takes no arguments, type the command followed by -? ¤  [HSI]/home/user1-> ls -?

Page 28: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Data Analysis and Visualization

¨  Geyser ¤  16 large memory nodes ¤  40 cores, 2.4 GHz Intel Westmere EX per node ¤  1 NVIDIA Quadro K5000 GPU per node ¤  1 TB 1600 MHz DDR3 memory per node

¨  Caldera ¤  30 visualization / compute nodes ¤  16 cores, 2.6 GHz Intel Sandy Bridge per node ¤  2 NVIDIA Tesla K20X GPUs per node (some nodes) ¤  64 GB 1600 MHz DDR3 memory per node

¨  Currently not available from Cheyenne, but will be soon

Page 29: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Data Analysis and Visualization

¨  Data Analysis and Visualization ¤  High-end servers available 7 x 24 for interactive data analysis, data-post

processing, and visualization ¨  Data Sharing

¤  Shared data access within the lab ¤  Access to the NCAR archival systems and NCAR data sets

¨  Remote Visualization ¤  Access to visual computing platforms from your laptop

¨  Visualization Consulting ¤  Consult with CISL staff on your visualization problems 

More info https://www2.cisl.ucar.edu/resources/geyser_caldera/visualization

Page 30: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Allocations and Account Info

¨  For accounting info and allocations log on to the Systems Account Manager (SAM) https://sam.ucar.edu

¨  Change shells ¨  Change default Unix group ¨  Change HPSS default project ¨  See active projects, charges, and balances

More Info https://www2.cisl.ucar.edu/docs/sam

Page 31: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Additional Training

¨  Consulting Group provides classes in topics of interest to CISL HPC users ¤ Fortran, C, C++ ¤ Python ¤ NCL, IDL, Matlab ¤ Parallel Computing with MPI and OpenMP ¤ Visualization ¤ Linux and UNIX Commands and System Use

¨  Classes are also archived for web viewing ¨  https://www2.cisl.ucar.edu/training

Page 32: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Contacting User Support

¨  CISL Homepage: ¤  http://www2.cisl.ucar.edu/

¨  CISL Consulting Services ¤  NCAR Mesa Lab Area 55, Floor 1B

¨  CISL HELP ¤  Call (303) 497-2400 ¤  Email to [email protected] ¤  Submit an Extraview ticket

Page 33: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Questions?

Page 34: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Job Control With LSF

¨  Job submission (bsub) ¤  bsub < script¤  bsub –U CESM_WS < script

n  Submit a job against reservation CESM_WS ¨  Job Monitoring (bjobs and bhist)

¤  bjobs

¤  bjobs -u all¤  bjobs -q regular

¤  bhist

¤  bhist –n <#> jobid

¨  Job Removal (bkill) ¤  bkill jobid

Page 35: CISL Mission for User Support - CESM · CISL Mission for User Support CISL also supports special colloquia, workshops and computational campaigns giving users special privileges and

Example LSF Job Script

#!/bin/csh# LSF batch script to run an MPI application##BSUB -P UESM0002 # project code#BSUB -W 01:20 # wall-clock time (hrs:mins)#BSUB -n 64 # number of MPI tasks in job #BSUB -R "span[ptile=16]" # run 16 MPI tasks per node#BSUB -J myjob # job name#BSUB -o myjob.%J.out # output file name %J is job ID#BSUB -e myjob.%J.err # error file name %J is job ID#BSUB -q regular # queue

#run the executablempirun.lsf ./myjob.exe

For more info: http://www2.cisl.ucar.edu/resources/yellowstone/using_resources/runningjobs