When to use Slurm
- You run batch jobs or distributed training and want a queue and scheduler.
- Your team shares a pool of GPUs and needs fair scheduling across jobs.
- You already have
sbatchjob scripts.
Prefer containers and orchestration? Use Kubernetes access instead. Want a single host you SSH into? See VM or Bare Metal.
Prerequisites
- The bid is promoted to a confirmed reservation that is visible in your portfolio.
- Checkout and payment for the reservation are complete.
- Your account has at least one active SSH public key registered — Slurm logins use your account keys. See Manage SSH keys.
Launch the cluster
Open Clusters
From the console, go to Clusters (
/clusters) and start a new cluster, or open your reservation in Portfolio and choose the Slurm access mode.Choose Slurm and a network mode
Pick Slurm, then a network mode:
- Public — the SSH login is reachable over the internet.
- Private — the SSH login is reachable only over your reservation’s WireGuard VPN.
Connect over SSH
When the scheduler is ready, the cluster panel shows your SSH login host and port, a ready-to-run Connect command, and the login host key so you can verify it on first connect. Copy those values and connect:Submit a job
On the login node, inspect the cluster and claim GPUs with the standard Slurm tools:sbatch script:
Private clusters: bring up WireGuard first
Private clusters expose nothing publicly — the SSH login is reachable only over your reservation’s WireGuard tunnel. Set it up once:Tear down
Tearing down stops the scheduler, revokes SSH login access, drains the workers, and returns the GPU nodes to your reservation. A torn-down or failed cluster relaunches in place.Troubleshooting
Permission denied (publickey)
Permission denied (publickey)
Slurm logins use your account SSH keys. Confirm the private key your SSH client uses matches a key registered on your account, and that the cluster has finished issuing login access (the panel shows the SSH login once it’s ready).
No SSH login details yet
No SSH login details yet
The scheduler starts after every node reports GPU-ready. Wait for the manage page to show the SSH login host and port; the first launch is paced by the GPU image pull.
Host key changed warning
Host key changed warning
If you reconnect after a relaunch, remove the old entry and reconnect:
ssh-keygen -R <login-host>. Verify the new key against the host key shown on the cluster panel.A private cluster won't connect
A private cluster won't connect
Make sure the WireGuard tunnel is up (
sudo wg show) before connecting. Re-generate the peer config from the Connect section if your public key changed.What’s next

Kubernetes access
Prefer container orchestration? Run a managed Kubernetes cluster on your reserved GPUs instead.

Manage SSH keys
Register the account SSH keys your Slurm logins use.

