| .TH SLURM "1" "March 2009" "slurm 2.0" "Slurm system" |
| |
| .SH "NAME" |
| slurm \- SLURM system overview. |
| |
| .SH "DESCRIPTION" |
| The Simple Linux Utility for Resource Management (SLURM) is an open source, |
| fault-tolerant, and highly scalable cluster management and job scheduling system |
| for large and small Linux clusters. SLURM requires no kernel modifications for |
| its operation and is relatively self-contained. As a cluster resource manager, |
| SLURM has three key functions. First, it allocates exclusive and/or non-exclusive |
| access to resources (compute nodes) to users for some duration of time so they |
| can perform work. Second, it provides a framework for starting, executing, and |
| monitoring work (normally a parallel job) on the set of allocated nodes. |
| Finally, it arbitrates contention for resources by managing a queue of |
| pending work. |
| Optional plugins can be used for accounting, advanced reservation, |
| gang scheduling (time sharing for parallel jobs), backfill scheduling, |
| resource limits by user or bank account, |
| and sophisticated multifactor job prioritization algorithms. |
| |
| SLURM has a centralized manager, \fBslurmctld\fR, to monitor resources and |
| work. There may also be a backup manager to assume those responsibilities in the |
| event of failure. Each compute server (node) has a \fBslurmd\fR daemon, which |
| can be compared to a remote shell: it waits for work, executes that work, returns |
| status, and waits for more work. An optional \fBslurmDBD\fR (SLURM DataBase Daemon) |
| can be used for accounting purposes and to maintain resource limit information. |
| |
| Basic user tools include \fBsrun\fR to initiate jobs, |
| \fBscancel\fR to terminate queued or running jobs, \fBsinfo\fR to report system |
| status, and \fBsqueue\fR to report the status of jobs. There is also an administrative |
| tool \fBscontrol\fR available to monitor and/or modify configuration and state |
| information. APIs are available for all functions. |
| |
| SLURM configuration is maintained in the \fBslurm.conf\fR file. |
| |
| Man pages are available for all SLURM commands, daemons, APIs, plus the |
| \fBslurm.conf\fR file. |
| Extensive documentation is also available on the internet at |
| \fB<http://slurm.schedmd.com/>\fR. |
| |
| .SH "COPYING" |
| Copyright (C) 2005\-2007 The Regents of the University of California. |
| Copyright (C) 2008\-2009 Lawrence Livermore National Security. |
| Produced at Lawrence Livermore National Laboratory (cf, DISCLAIMER). |
| CODE\-OCEC\-09\-009. All rights reserved. |
| .LP |
| This file is part of SLURM, a resource management program. |
| For details, see <http://slurm.schedmd.com/>. |
| .LP |
| SLURM is free software; you can redistribute it and/or modify it under |
| the terms of the GNU General Public License as published by the Free |
| Software Foundation; either version 2 of the License, or (at your option) |
| any later version. |
| .LP |
| SLURM is distributed in the hope that it will be useful, but WITHOUT ANY |
| WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS |
| FOR A PARTICULAR PURPOSE. See the GNU General Public License for more |
| details. |
| |
| .SH "SEE ALSO" |
| \fBsacct\fR(1), \fBsacctmgr\fR(1), \fBsalloc\fR(1), \fBsattach\fR(1), |
| \fBsbatch\fR(1), \fBsbcast\fR(1), \fBscancel\fR(1), \fBscontrol\fR(1), |
| \fBsinfo\fR(1), \fBsmap\fR(1), \fBsqueue\fR(1), \fBsreport\fR(1), |
| \fBsrun\fR(1), \fBsshare\fR(1), \fBsstate\fR(1), \fBstrigger\fR(1), |
| \fBsview\fR(1), |
| \fBbluegene.conf\fR(5), \fBslurm.conf\fR(5), \fBslurmdbd.conf\fR(5), |
| \fBwiki.conf\fR(5), |
| \fBslurmctld\fR(8), \fBslurmd\fR(8), \fBslurmdbd\fR(8), \fBslurmstepd\fR(8), |
| \fBspank\fR(8) |
| |