Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
public:stopdayactivities_10apr2018 [2018-04-06 06:45] – [CEP3] Arno Schoenmakers | public:stopdayactivities_10apr2018 [2018-04-11 10:14] (current) – [Core switches] grit | ||
---|---|---|---|
Line 4: | Line 4: | ||
^ Coordinator | Teun Grit | roadmin@astron.nl | | ^ Coordinator | Teun Grit | roadmin@astron.nl | | ||
- | ^ Software Support | Thomas Juerges + Jorrit | + | ^ Software Support | Thomas Juerges + Arno Schaap (2 days)| softwaresupport@astron.nl | |
^ Science, Operations and Support | Pietro Zucca | sos@astron.nl | | ^ Science, Operations and Support | Pietro Zucca | sos@astron.nl | | ||
^ Observer | Jur Sluman | observer@astron.nl | | ^ Observer | Jur Sluman | observer@astron.nl | | ||
Line 19: | Line 19: | ||
==== Cobalt ==== | ==== Cobalt ==== | ||
- | * Reboots and idrac reboots. (Hopko/ | + | * ✔ Reboots and idrac reboots. (Hopko/ |
- | * CBM010 will be present before the stopday | + | * ✔ CBM010 will be present before the stopday |
==== CEP3 ==== | ==== CEP3 ==== | ||
- | * Block access at 08:00 (Teun) | + | * ✔ Block access at 08:00 (Teun) |
- | * Mount DAC cables to headnodes to support floating IP (Hopko/ | + | * ✔ Mount DAC cables to headnodes to support floating IP (Hopko/ |
- | * All nodes: file system check and reboot. (Hopko, Robin) | + | * ✔ All nodes: file system check and reboot. (Hopko, Robin) |
- | * Check/debug persistence of Slurm reservations (Reinoud) | + | * ✔ Was broken: |
- | * NFS mounts for cep3, from all the lof-nodes are using control! | + | * ✔ NFS mounts for cep3, from all the lof-nodes are using control! |
==== CEP4 ==== | ==== CEP4 ==== | ||
- | * Connect DAC cable (Hopko) | + | * ✔ Connect DAC cable (Hopko) |
==== LEXARS ==== | ==== LEXARS ==== | ||
- | * lexar003 reboot using XCAT (148 days up) (Hopko/Robin) | + | * |
==== LCU ==== | ==== LCU ==== | ||
- | * Reboot of all LCU's | + | * |
==== Central Services ==== | ==== Central Services ==== | ||
- | * Update portals to CentOS/ | + | * ✔ Update portals to CentOS/ |
- | * Remove sas001 | + | * ✔ Update and reboot lcs020 .. lcs30 |
- | * Update & reboot NFS server lcs115 | + | * ✔ Remove sas001 |
- | * Almost all nfs mounts on the lcs115 nfs server are over the control network. Only a few correctly use the offline network: only lexar003, lexar004, and lhd002.(Kees) | + | * ✔ Update & reboot NFS server lcs115 |
+ | * ✔ Almost all nfs mounts on the lcs115 nfs server are over the control network. Only a few correctly use the offline network: only lexar003, lexar004, and lhd002. | ||
+ | * ✔ Mainly MAC/SAS, LCU's and Aartfaac machines use NFS over control VLAN. They don't have a off-line of on-line network connection. | ||
* Check resolv.conf settings; see https:// | * Check resolv.conf settings; see https:// | ||
==== LTA ==== | ==== LTA ==== | ||
- | * Update and reboot when required (Reinoud) | + | * ✔ Update and reboot when required (Reinoud) |
==== Aartfaac ==== | ==== Aartfaac ==== | ||
- | * Check for broken disks | + | * ✘ Check for broken disks **Fail**: ais007 has a degraded RAID1 !! |
+ | * **For review**: Add Jasmin & Reinoud to all nodes as admin | ||
==== Core switches ==== | ==== Core switches ==== | ||
Line 61: | Line 64: | ||
+ | ==== Communication issues ==== | ||
+ | **For review**: At the end of the 1st day software support needs to report status to coordinator | ||
===== Software updates ===== | ===== Software updates ===== | ||
Line 74: | Line 79: | ||
==== CEP3 ==== | ==== CEP3 ==== | ||
- | * Reboot / fs checks | + | * ✔ Reboot / fs checks |
- | * Make AOFlagger 2.10 the default version (already installed) | + | * ✔ Make AOFlagger 2.10 the default version (already installed) |
- | * Make WSClean 2.5 the default version (already installed) | + | * ✔ Make LOFAR-Release-3_0_14 the default version (linked against AOFlagger 2.10) |
+ | * ✔ Make WSClean 2.5 the default version (already installed) | ||
==== CEP4 ==== | ==== CEP4 ==== |