Download presentation
Presentation is loading. Please wait.
Published byAlberta Allen Modified over 5 years ago
1
Hodor HPC Cluster LON MNG HPN Head Node Comp Node Comp Node Comp Node
Parallel Storage
2
Hodor HPC Cluster Head Node 2 CPUs – 8 Cores - SandyBridge 64GB RAM
Module Files Open MPI, Intel MPI GCC, Intel COMPILERS SLURM Job Scheduler Head Node HPN MNG LON Comp Node Parallel Storage
3
Hodor HPC Cluster Head Node Must Request an Account Authentication
Campus Connection File Transfer SCP SFTP Globus Compile Very Short Test Runs Do not USE mpirun / mpiexec Head Node HPN MNG LON Comp Node Parallel Storage
4
Hodor HPC Cluster Head Node SSH Keys Are Allowed Must Append new keys
Replacing keys will break your account Must be on-campus to connect Or use Campus VPN Incorrect password attempts will block user access. Head Node HPN MNG LON Comp Node Parallel Storage
5
Hodor HPC Cluster Compute Nodes x 32 2 CPUs – 4 Cores – SandyBridge
16 Nvidia Tesla K20m GPUs 16 Xeon Phi 31P1s Co-processors 64GB RAM Module Files Use “srun” to get Bash Shell Head Node HPN MNG LON Comp Node Parallel Storage
6
Shared File Space /home/user.name /share/apps /cm/shared Not Shared
/tmp Heavy I/O should be done in: Must be copied before end of job. Head Node HPN MNG LON Comp Node Parallel Storage
7
SLURM Queue Information Commands
squeue sinfo qstat –a qstat –n qstat –f
8
SLURM Job Submission Commands
srun – for interactive - use of “screen” sbatch - for batch runs
9
SLURM Job Submission Commands
Examples: /share/apps/slurm/examples sbatch somescript.sh srun –N1 –n --pty bash
10
Important sbatch arguments
#SBATCH –N8 #SBATCH --ntasks-per-node=8 #SBATCH –t 00:10:00 #SBATCH –o ./out_test.txt #SBATCh –e ./err_test.txt
11
Important sbatch environment variables
$SLURM_SUBMIT_DIR Path from current dir when sbatch was invoked. $SLURM_NTASKS Total number of tasks in your job $SLURM_JOB_ID ID number identifying your SLURM job.
12
SLURM Job Delete Commands
scancel #### qdel ####
13
Modulefile Commands module avail module load module list
14
www.crc.und.edu Website: Hodor Help Info
Tutorials & Desktop Software > Linux HPC Cluster (Hodor) Hodor does not have a job submission queue called “test”
15
Get Hodor Account Send to:
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.