public:slurm_upgrade_to_v17.02.2

This is an old revision of the document!


Checklist slurm upgrade

  1. stop slurm accross the cluster (slurmctld, slurmdbd, slurmd)
  2. dump mysql database 'slurmaccounting_cep4' at lcs032 (see lcs032:/usr/local/bin/mysql_backup.sh)
  3. dump slurm accounts and reservations on head01:
    scontrol show res -o > /root/slurm/reservations.`date +\%a`.txt
    sacctmgr dump lofar_cep3 File=/root/slurm/sacctmgr.`date +\%a`.dump
    sacctmgr show qos > /root/slurm/qos.`date +\%a`.dump
      - create backup of /data/home/slurm
      - yum install slurm
      - start slurm on head node
      - check accounts / users: sacctmgr
      - check cobalt reservation
      - start nodes. Check sinfo -R.

In case accounts need to be recreated:

sacctmgr create qos inspectionplots Priority=10 Preempt=normal
sacctmgr create qos permanent Priority=20]
sacctmgr load sacctmgr.Thu.dump
  • Last modified: 2019-01-31 14:47
  • by Reinoud Bokhorst