Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
| public:stopdayactivities_5jun2018 [2018-06-04 14:02] – [CEP4] Reinoud Bokhorst | public:stopdayactivities_5jun2018 [2018-06-05 11:52] (current) – [Stop-day activities June 5-6, 2018] Reinoud Bokhorst | ||
|---|---|---|---|
| Line 5: | Line 5: | ||
| ^ Coordinator | Reinoud Bokhorst | roadmin@astron.nl | | ^ Coordinator | Reinoud Bokhorst | roadmin@astron.nl | | ||
| ^ Software Support | Arno Schoenmakers | softwaresupport@astron.nl | | ^ Software Support | Arno Schoenmakers | softwaresupport@astron.nl | | ||
| - | ^ Science, Operations and Support | Matthijs van der Wiel | sos@astron.nl | | + | ^ Science, Operations and Support | Pietro Zucca | sos@astron.nl | |
| ^ Observer | Henk Mulder | observer@astron.nl | | ^ Observer | Henk Mulder | observer@astron.nl | | ||
| Line 18: | Line 18: | ||
| ==== Cobalt ==== | ==== Cobalt ==== | ||
| - | * Reboots and idrac reboots. (Hopko/Robin) | + | * ✔ Reboots and idrac reboots. (Hopko) |
| ==== CEP3 ==== | ==== CEP3 ==== | ||
| - | * Block access at 08:00 (Teun) | + | * ✔ Block access at 08:00 (Teun) |
| - | * All nodes: file system check and reboot. (Kees) | + | * ✔ All nodes: file system check and reboot. (Kees) |
| ==== CEP4 ==== | ==== CEP4 ==== | ||
| - | * Reboot (Hopko) | + | * ✔ Reboot (Hopko) |
| - | * Recreate Docker thinpools on CPU nodes | + | * ✔ Recreate Docker thinpools on CPU nodes |
| - | * Recabling of Infiniband, details in Jira ticket | + | * ✔ Recabling of Infiniband, details in Jira ticket |
| * Performance tests after recabling | * Performance tests after recabling | ||
| ==== LEXARS ==== | ==== LEXARS ==== | ||
| - | * Reboot (Hopko/ | + | * |
| Line 45: | Line 45: | ||
| ==== Central Services ==== | ==== Central Services ==== | ||
| - | * Restart qpidd@ccu001 (ref. https:// | + | * ✔ Restart qpidd@ccu001 (ref. https:// |
| - | * Test DMZ KVM Failover | + | * ✔ Test DMZ KVM Failover |
| - | * OS upgrade and reboot | + | * ✔ OS upgrade and reboot |
| ==== LTA ==== | ==== LTA ==== | ||
| - | * Update and reboot (Reinoud) | + | * ✔ Update and reboot (Reinoud) |
| - | * Migration of Oracle DB to new hardware (Andrey Tsyganov) | + | * ✔ Migration of Oracle DB to new hardware (Andrey Tsyganov) |
| ==== Aartfaac ==== | ==== Aartfaac ==== | ||
| - | * Check for broken disks **Fail**: ais007 had a degraded RAID1, but a controller firmware update helped. | + | * ✔ Check for broken disks **Fail**: ais007 had a degraded RAID1, but a controller firmware update helped. |
| Line 60: | Line 60: | ||
| ==== Core switches ==== | ==== Core switches ==== | ||
| - | * Warm reset PD0, RD0 and RD1 (Arjen) | + | * ✔ Warm reset PD0, RD0 and RD1 (Arjen) |
| Line 81: | Line 81: | ||
| * synchronize Python packages, see list in ticket | * synchronize Python packages, see list in ticket | ||
| + | * ✔ umask change for foreign stations | ||
| ==== CEP4 ==== | ==== CEP4 ==== | ||
| * Rollout Docker images | * Rollout Docker images | ||
| - | * SLURM upgrade | + | * ✘ SLURM upgrade |
| ==== Aartfaac ==== | ==== Aartfaac ==== | ||