public:stopdayactivities_5jun2018

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
public:stopdayactivities_5jun2018 [2018-06-05 08:05] – [LCU] Reinoud Bokhorstpublic:stopdayactivities_5jun2018 [2018-06-05 11:52] (current) – [Stop-day activities June 5-6, 2018] Reinoud Bokhorst
Line 5: Line 5:
 ^ Coordinator | Reinoud Bokhorst | roadmin@astron.nl | ^ Coordinator | Reinoud Bokhorst | roadmin@astron.nl |
 ^ Software Support | Arno Schoenmakers | softwaresupport@astron.nl | ^ Software Support | Arno Schoenmakers | softwaresupport@astron.nl |
-^ Science, Operations and Support | Matthijs van der Wiel | sos@astron.nl |+^ Science, Operations and Support | Pietro Zucca | sos@astron.nl |
 ^ Observer | Henk Mulder | observer@astron.nl | ^ Observer | Henk Mulder | observer@astron.nl |
  
Line 18: Line 18:
 ==== Cobalt ==== ==== Cobalt ====
  
-  * Reboots and idrac reboots. (Hopko)+  * ✔ Reboots and idrac reboots. (Hopko)
  
  
 ==== CEP3 ==== ==== CEP3 ====
  
-  * Block access at 08:00 (Teun) +  * ✔ Block access at 08:00 (Teun) 
-  * All nodes: file system check and reboot. (Kees)+  * ✔ All nodes: file system check and reboot. (Kees)
  
  
 ==== CEP4 ==== ==== CEP4 ====
  
-  * Reboot (Hopko) +  * ✔ Reboot (Hopko) 
-  * Recreate Docker thinpools on CPU nodes +  * ✔ Recreate Docker thinpools on CPU nodes 
-  * Recabling of Infiniband, details in Jira ticket+  * ✔ Recabling of Infiniband, details in Jira ticket
   * Performance tests after recabling   * Performance tests after recabling
  
 ==== LEXARS ==== ==== LEXARS ====
  
-  *  Reboot (Hopko/Robin)+  *  ✔ Reboot (Hopko/Robin)
  
  
Line 45: Line 45:
 ==== Central Services ==== ==== Central Services ====
    
-  * Restart qpidd@ccu001 (ref. https://support.astron.nl/jira/browse/ROADMT-99) +  * ✔ Restart qpidd@ccu001 (ref. https://support.astron.nl/jira/browse/ROADMT-99) 
-  * Test DMZ KVM Failover  (DMZ KVM Hypervisor hosts DMZ services (portal,dns server,smtp,proxy etc)) +  * ✔ Test DMZ KVM Failover  (DMZ KVM Hypervisor hosts DMZ services (portal,dns server,smtp,proxy etc)) 
-  * OS upgrade and reboot+  * ✔ OS upgrade and reboot
 ==== LTA ==== ==== LTA ====
  
-  * Update and reboot (Reinoud) +  * ✔ Update and reboot (Reinoud) 
-  * Migration of Oracle DB to new hardware (Andrey Tsyganov)+  * ✔ Migration of Oracle DB to new hardware (Andrey Tsyganov)
 ==== Aartfaac ==== ==== Aartfaac ====
  
-  * Check for broken disks **Fail**: ais007 had a degraded RAID1, but a controller firmware update helped.+  * ✔ Check for broken disks **Fail**: ais007 had a degraded RAID1, but a controller firmware update helped.
  
  
Line 60: Line 60:
 ==== Core switches ==== ==== Core switches ====
  
-  * Warm reset PD0, RD0 and RD1 (Arjen)+  * ✔ Warm reset PD0, RD0 and RD1 (Arjen)
  
  
Line 86: Line 86:
  
   * Rollout Docker images   * Rollout Docker images
-  * SLURM upgrade+  * ✘ SLURM upgrade  (postponed)
 ==== Aartfaac ==== ==== Aartfaac ====
  
  • Last modified: 2018-06-05 08:05
  • by Reinoud Bokhorst