Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
dragnet:cluster_usage [2017-08-17 22:01] – encode email addresses to reduce spam (this is a publicly visible page) amesfoort | dragnet:cluster_usage [2019-01-07 15:06] (current) – [Access and Login] Reinoud Bokhorst | ||
---|---|---|---|
Line 8: | Line 8: | ||
===== Access and Login ===== | ===== Access and Login ===== | ||
To get an account, get permission from the Dragnet PI: Jason Hessels ('' | To get an account, get permission from the Dragnet PI: Jason Hessels ('' | ||
- | Easiest is to ask him to send his permission to Teun Grit ('' | + | Easiest is to ask him to send his permission to the RO Sysadmins |
- | You can also provide | + | You can also provide |
Having an account, ssh to hostname '' | Having an account, ssh to hostname '' | ||
Line 332: | Line 332: | ||
$ scontrol resume 100 | $ scontrol resume 100 | ||
$ scontrol resume [1000,2000] | $ scontrol resume [1000,2000] | ||
+ | | ||
+ | ==== SLURM Troubleshooting ==== | ||
+ | == " | ||
+ | |||
+ | If you expect that there should be enough resources, but slurm submission fails because some nodes could be in " | ||
+ | something like this, where nodes drg06 and drg08 are in drain state: | ||
+ | |||
+ | $ sinfo | ||
+ | PARTITION AVAIL TIMELIMIT | ||
+ | workers* | ||
+ | workers* | ||
+ | workers* | ||
+ | head | ||
+ | |||
+ | To " | ||
+ | $ scontrol update NodeName=drg08 State=DOWN Reason=" | ||
+ | $ scontrol update NodeName=drg08 State=RESUME | ||