Debrecen2 GPU klaszter en

ssh USER@login.debrecen2.hpc.niif.hu

Up: scp FILE USER@login.debrecen2.hpc.niif.hu: FILE
Down: scp USER@login.debrecen2.hpc.niif.hu: FILE FILE

Up: rsync -a -e ssh DIRECTORY USER@login.debrecen2.hpc.niif.hu:/home/USER
Down: rsync -a -e ssh USER@login.debrecen2.hpc.niif.hu:/home/USER/DIRECTORY

               short form of CWD
                     |
    DEBRECEN2[login] ~ (0)$
        |       |       |
   HPC station  |       |
    short machine name  |
               exit code of the previous command

module avail

module list

module load APP

setfacl -m u:OTHER:rx $HOME

setfacl -m u:OTHER:rxw $HOME/DIRECTORY

getfacl $HOME/DIRECTORY

/mnt/fhgfs/home/$USER

rsync -avuP --delete $HOME/DIRECTORY /mnt/fhgfs/home/$USER

sbalance

Scheduler Account Balance
---------- ----------- + ---------------- ----------- + ------------- -----------
User             Usage |          Account       Usage | Account Limit   Available (CPU hrs)
---------- ----------- + ---------------- ----------- + ------------- -----------
bob *                7 |           foobar           7 |         1,000         993
alice                0 |           foobar           7 |         1,000         993

sestimate -N NODES -t WALLTIME

scontrol show job JOBID

sacct -l -j JOBID

smemory JOBID

sdisk JOBID

Resources / AssociationResourceLimit - Waiting for a resource
AssociationJobLimit / QOSJobLimit - Not enough CPU time or maximum CPU number is reserved
Priority - Waiting due to low priority


slicenses

sreservations

susage

sreport -t Hours Cluster AccountUtilizationByUser Accounts=ACCOUNT Start=2015-01-01

#!/bin/bash
#SBATCH -A ACCOUNT
#SBATCH --job-name=NAME
#SBATCH --time=TIME

#SBATCH --gres=gpu:N

srun -l -n 1 -t TIME --gres=gpu:1 -A ACCOUNT APP

sbatch slurm.sh

Submitted batch job JOBID

scancel JOBID

#SBATCH --no-requeue

#SBATCH --partition=prod-gpu-k40

#SBATCH --qos=fast

#SBATCH --qos=lowpri

#SBATCH --mem-per-cpu=MEMORY

#SBATCH --mail-type=ALL
#SBATCH --mail-user=EMAIL

#!/bin/bash
#SBATCH -A ACCOUNT
#SBATCH --job-name=array
#SBATCH --time=24:00:00
#SBATCH --array=1-96
srun envtest.sh

#!/bin/bash
#SBATCH -A ACCOUNT
#SBATCH --job-name=mpi
#SBATCH -N 2
#SBATCH --ntasks-per-node=8
#SBATCH --time=12:00:00
mpirun --report-pid ${TMPDIR}/mpirun.pid PROGRAM

#!/bin/bash
#SBATCH -A foobar
#SBATCH --job-name=omp
#SBATCH --time=06:00:00
#SBATCH --ntasks=1
#SBATCH --cpus-per-task=8
OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK ./a.out

#!/bin/bash
#SBATCH -A foobar
#SBATCH --job-name=mpiomp
#SBATCH --time=08:00:00
#SBATCH -N 2
#SBATCH --ntasks=2
#SBATCH --ntasks-per-node=1
#SBATCH --cpus-per-task=8
#SBATCH -o slurm.out
export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK
mpirun ./a.out

#!/bin/bash
#SBATCH -A foobar
#SBATCH --job-name=maple
#SBATCH -N 1
#SBATCH --ntasks-per-node=16
#SBATCH --time=06:00:00
#SBATCH -o slurm.out
#SBATCH --licenses=maplegrid:1

module load maple

${MAPLE}/toolbox/Grid/bin/startserver
${MAPLE}/toolbox/Grid/bin/joblauncher ${MAPLE}/toolbox/Grid/samples/Simple.mpl

Cluster	Debrecen2 (Leo)
Type	HP SL250s
Core / node	8 × 2 Xeon E5-2650v2 2.60GHz
GPU / node	68 * 3 Nvidia K20x + 16 * 3 Nvidia K40x
# of compute nodes	84
Max Walltime	7-00:00:00
Max core / project	336
Max mem / core	7000 MB

Debrecen2 GPU klaszter en

Tartalomjegyzék

Requesting CPU time

Login

Copying files with SCP

Data synchronization

User interface

Module environment

Data sharing for project members

Using a shared home directory

Compiling applications

Using the SLURM scheduler

Estimating CPU time

Status information

SLURM warnings

Checking licenses

Checking maintenance

Aggregate consumption

Total consumption

Submitting jobs

Mandatory parameters

Reservation of GPUs

Interactive use

Submitting batch jobs

Non-restarting jobs

Partitions

Quality of Service (QoS)

High priority

Low priority

Memory allocation

Email notification

Arrayjobs

OpenMPI jobs

OpenMP (OMP) jobs

Hybrid MPI-OMP jobs

Maple Grid jobs

Navigációs menü

Keresés