Why is my job not running
Revision as of 19:37, 23 June 2022 by Cabreraruizdiazj (talk | contribs)
According to SLURM documentation, when a job cannot be started a reason is immediately found and recorded in the job's "reason" field in the squeue output and the scheduler moves on to the next job to consider.
Related article: Account and QOS limits under SLURM
Common reasons why jobs are pending:
- Priority
- Resources being reserved for higher priority job. This is particularly common on Burst QOS jobs. Refer to the Choosing QOS for a Job page for details.
- Resources
- Required resources are in use
- Dependency
- Job dependencies not yet satisfied
- Reservation
- Waiting for advanced reservation
- AssociationJobLimit
- User or account job limit reached
- AssociationResourceLimit
- User or account resource limit reached
- AssociationTimeLimit
- User or account time limit reached
- QOSJobLimit
- Quality Of Service (QOS) job limit reached
- QOSResourceLimit
- Quality Of Service (QOS) resource limit reached
- QOSTimeLimit
- Quality Of Service (QOS) time limit reached