Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
public:stopdayactivities_5jun2018 [2018-06-05 08:05] – [LCU] Reinoud Bokhorst | public:stopdayactivities_5jun2018 [2018-06-05 11:52] (current) – [Stop-day activities June 5-6, 2018] Reinoud Bokhorst | ||
---|---|---|---|
Line 5: | Line 5: | ||
^ Coordinator | Reinoud Bokhorst | roadmin@astron.nl | | ^ Coordinator | Reinoud Bokhorst | roadmin@astron.nl | | ||
^ Software Support | Arno Schoenmakers | softwaresupport@astron.nl | | ^ Software Support | Arno Schoenmakers | softwaresupport@astron.nl | | ||
- | ^ Science, Operations and Support | Matthijs van der Wiel | sos@astron.nl | | + | ^ Science, Operations and Support | Pietro Zucca | sos@astron.nl | |
^ Observer | Henk Mulder | observer@astron.nl | | ^ Observer | Henk Mulder | observer@astron.nl | | ||
Line 18: | Line 18: | ||
==== Cobalt ==== | ==== Cobalt ==== | ||
- | * Reboots and idrac reboots. (Hopko) | + | * ✔ Reboots and idrac reboots. (Hopko) |
==== CEP3 ==== | ==== CEP3 ==== | ||
- | * Block access at 08:00 (Teun) | + | * ✔ Block access at 08:00 (Teun) |
- | * All nodes: file system check and reboot. (Kees) | + | * ✔ All nodes: file system check and reboot. (Kees) |
==== CEP4 ==== | ==== CEP4 ==== | ||
- | * Reboot (Hopko) | + | * ✔ Reboot (Hopko) |
- | * Recreate Docker thinpools on CPU nodes | + | * ✔ Recreate Docker thinpools on CPU nodes |
- | * Recabling of Infiniband, details in Jira ticket | + | * ✔ Recabling of Infiniband, details in Jira ticket |
* Performance tests after recabling | * Performance tests after recabling | ||
==== LEXARS ==== | ==== LEXARS ==== | ||
- | * Reboot (Hopko/ | + | * |
Line 45: | Line 45: | ||
==== Central Services ==== | ==== Central Services ==== | ||
- | * Restart qpidd@ccu001 (ref. https:// | + | * ✔ Restart qpidd@ccu001 (ref. https:// |
- | * Test DMZ KVM Failover | + | * ✔ Test DMZ KVM Failover |
- | * OS upgrade and reboot | + | * ✔ OS upgrade and reboot |
==== LTA ==== | ==== LTA ==== | ||
- | * Update and reboot (Reinoud) | + | * ✔ Update and reboot (Reinoud) |
- | * Migration of Oracle DB to new hardware (Andrey Tsyganov) | + | * ✔ Migration of Oracle DB to new hardware (Andrey Tsyganov) |
==== Aartfaac ==== | ==== Aartfaac ==== | ||
- | * Check for broken disks **Fail**: ais007 had a degraded RAID1, but a controller firmware update helped. | + | * ✔ Check for broken disks **Fail**: ais007 had a degraded RAID1, but a controller firmware update helped. |
Line 60: | Line 60: | ||
==== Core switches ==== | ==== Core switches ==== | ||
- | * Warm reset PD0, RD0 and RD1 (Arjen) | + | * ✔ Warm reset PD0, RD0 and RD1 (Arjen) |
Line 86: | Line 86: | ||
* Rollout Docker images | * Rollout Docker images | ||
- | * SLURM upgrade | + | * ✘ SLURM upgrade |
==== Aartfaac ==== | ==== Aartfaac ==== | ||