This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
calcul:cluster:files_d_attente [2014/05/09 14:52] stephane caminade |
calcul:cluster:files_d_attente [2016/02/22 16:54] (current) stephane caminade [Exemple] |
||
---|---|---|---|
Line 3: | Line 3: | ||
===== Coprocesseurs ===== | ===== Coprocesseurs ===== | ||
^ SolarOrbiter | 1 R720+2K20 | | ^ SolarOrbiter | 1 R720+2K20 | | ||
- | ^ K20 | 2 R270+2K20 | | + | ^ K20 | 2 R270+2K20 et 1 R730+K20| |
^ PHI | 2 R720+2PHI | | ^ PHI | 2 R720+2PHI | | ||
- | ^ ERM_COPRO | 1 R720+2K20 et 2 R720+2PHI | | + | ^ ERM_COPRO | 1 R720+2K20 et 1 R730+K20 et 2 R720+2PHI | |
- | ^ **COPRO** | **2 R720+2K20 et 2 R720+2PHI **| | + | ^ **COPRO** | **2 R720+2K20 et 1 R730+K20 et 2 R720+2PHI **| |
Line 13: | Line 13: | ||
^ MIC | 1 R820 | | ^ MIC | 1 R820 | | ||
^ HERSCHEL | 1 R820 | | ^ HERSCHEL | 1 R820 | | ||
- | ^ **CPU_ONLY** | **4 R820 et 2 R720** | | + | ^ FLARECAST | 1 R730 | |
+ | ^ **CPU_ONLY** | **4 R820, 2 R720, 1 R730** | | ||
===== Ensemble des machines ===== | ===== Ensemble des machines ===== | ||
^ **ALL** | **Tous les noeuds** | | ^ **ALL** | **Tous les noeuds** | | ||
+ | |||
+ | ===== Priorités ===== | ||
+ | Plus la valeur est élevée, plus la file d'attente est prioritaire. | ||
+ | |||
+ | <color red>**Un job dans une file d'attente prioritaire suspendra l'exécution d'un job alloué dans une file de priorité inférieure.**</color> | ||
+ | |||
+ | ^ File d'attente ^ Priorité | | ||
+ | ^ ALL | 1 | | ||
+ | ^ CPU_ONLY | 1 | | ||
+ | ^ COPRO | 50 | | ||
+ | ^ K20 | 50 | | ||
+ | ^ PHI | 50 | | ||
+ | ^ ERM_COPRO | 100 | | ||
+ | ^ FLARECAST | 100 | | ||
+ | ^ HERSCHEL | 100 | | ||
+ | ^ MIC | 100 | | ||
+ | ^ MISTIC | 100 | | ||
+ | ^ SOLARORBITER | 100 | | ||
+ | |||
+ | ==== Exemple ==== | ||
+ | Si on considère une liste de jobs alloués dans la file de plus basse priorité, ALL : | ||
+ | <code> | ||
+ | root@cluster-head:~# squeue -l | ||
+ | Mon Feb 22 15:06:28 2016 | ||
+ | JOBID PARTITION NAME USER STATE TIME TIMELIMIT NODES NODELIST(REASON) | ||
+ | 2115 K20 bash scaminad RUNNING 54:49 UNLIMITED 1 cluster-r720-k20-1 | ||
+ | 2131 all MPI IDL abeelen RUNNING 0:59 UNLIMITED 1 cluster-r820-2 | ||
+ | 2130 all MPI IDL abeelen RUNNING 1:27 UNLIMITED 1 cluster-r820-1 | ||
+ | 2128 all MPI IDL abeelen RUNNING 5:30 UNLIMITED 1 cluster-r730-k20-1 | ||
+ | 2125 all MPI IDL abeelen RUNNING 23:07 UNLIMITED 1 cluster-r720-phi-2 | ||
+ | 2124 all MPI IDL abeelen RUNNING 24:14 UNLIMITED 1 cluster-r720-phi-1 | ||
+ | 2123 all MPI IDL abeelen RUNNING 23:05 UNLIMITED 1 cluster-r720-k20-2 | ||
+ | 2122 all MPI IDL abeelen RUNNING 26:42 UNLIMITED 1 cluster-r720-2 | ||
+ | 2121 all MPI IDL abeelen RUNNING 27:28 UNLIMITED 1 cluster-r720-1 | ||
+ | </code> | ||
+ | Et qu'on fait une allocation dans une file de priorité supérieure à celle de ALL : | ||
+ | <code> | ||
+ | root@cluster-head:~# salloc -N 2 -p K20 --exclusive | ||
+ | salloc: Granted job allocation 2132 | ||
+ | root@cluster-head:~# squeue -l | ||
+ | Mon Feb 22 15:06:40 2016 | ||
+ | JOBID PARTITION NAME USER STATE TIME TIMELIMIT NODES NODELIST(REASON) | ||
+ | 2132 K20 bash root RUNNING 0:04 UNLIMITED 2 cluster-r720-k20-2,cluster-r730-k20-1 | ||
+ | 2115 K20 bash scaminad RUNNING 55:01 UNLIMITED 1 cluster-r720-k20-1 | ||
+ | 2131 all MPI IDL abeelen RUNNING 1:11 UNLIMITED 1 cluster-r820-2 | ||
+ | 2130 all MPI IDL abeelen RUNNING 1:39 UNLIMITED 1 cluster-r820-1 | ||
+ | 2125 all MPI IDL abeelen RUNNING 23:19 UNLIMITED 1 cluster-r720-phi-2 | ||
+ | 2124 all MPI IDL abeelen RUNNING 24:26 UNLIMITED 1 cluster-r720-phi-1 | ||
+ | 2122 all MPI IDL abeelen RUNNING 26:54 UNLIMITED 1 cluster-r720-2 | ||
+ | 2121 all MPI IDL abeelen RUNNING 27:40 UNLIMITED 1 cluster-r720-1 | ||
+ | 2128 all MPI IDL abeelen SUSPENDE 5:38 UNLIMITED 1 cluster-r730-k20-1 | ||
+ | 2123 all MPI IDL abeelen SUSPENDE 23:13 UNLIMITED 1 cluster-r720-k20-2 | ||
+ | </code> | ||
+ | Les 2 jobs qui tournaient sur des machines de la file K20 se retrouvent suspendus. |