public:stopdayactivities_10apr2018

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
public:stopdayactivities_10apr2018 [2018-04-10 10:53] – [Central Services] gritpublic:stopdayactivities_10apr2018 [2018-04-11 10:14] (current) – [Core switches] grit
Line 24: Line 24:
  
   * ✔ Block access at 08:00 (Teun)   * ✔ Block access at 08:00 (Teun)
-  * Mount DAC cables to headnodes to support floating IP (Hopko/Robin)+  * ✔ Mount DAC cables to headnodes to support floating IP (Hopko/Robin)
   * ✔ All nodes: file system check and reboot. (Hopko, Robin)   * ✔ All nodes: file system check and reboot. (Hopko, Robin)
-  * Check/debug persistence of Slurm reservations (Reinoud) +  * ✔ Was broken: Check/debug persistence of Slurm reservations (Reinoud)  **For the review**: We should have a backup in place! 
-  * NFS mounts for cep3, from all the lof-nodes are using control!+  * ✔ NFS mounts for cep3, from all the lof-nodes are using control!
  
  
 ==== CEP4 ==== ==== CEP4 ====
  
-  * Connect DAC cable (Hopko)+  * ✔ Connect DAC cable (Hopko)
  
 ==== LEXARS ==== ==== LEXARS ====
  
-  *  lexar003 reboot using XCAT (148 days up) (Hopko/Robin)+  *  ✔ lexar003 reboot using XCAT (148 days up) (Was done by Reinoud 9-4-2018)
  
 ==== LCU ==== ==== LCU ====
  
-  *  ✔ Reboot of all Dutch LCU's (teun) (ILT stations in local mode)+  *  ✔ Reboot of all Dutch LCU's (teun) (ILT stations in local mode) **For the review**: We discovered that all LCU's have a fixed NFS mount on /home. This should be an automount.
  
 ==== Central Services ==== ==== Central Services ====
    
   * ✔ Update portals to CentOS/KVM   * ✔ Update portals to CentOS/KVM
-  *  +  * ✔ Update and reboot lcs020 .. lcs30 
-  * Remove sas001+  * ✔ Remove sas001 and sas099 (powered off, disconnect cables)
   * ✔ Update & reboot NFS server lcs115   * ✔ Update & reboot NFS server lcs115
-  * Almost all nfs mounts on the lcs115 nfs server are over the control network. Only a few correctly use the offline network: only lexar003, lexar004, and lhd002.(Kees) Mainly MAC/SAS, LCU's and Aartfaac machines use NFS over control VLAN. They don't have a off-line of on-line network connection.+  * ✔ Almost all nfs mounts on the lcs115 nfs server are over the control network. Only a few correctly use the offline network: only lexar003, lexar004, and lhd002. We should force CEP3 to use off-line! 
 +  * ✔ Mainly MAC/SAS, LCU's and Aartfaac machines use NFS over control VLAN. They don't have a off-line of on-line network connection.
   * Check resolv.conf settings; see https://support.astron.nl/lofar_issuetracker/issues/10448   * Check resolv.conf settings; see https://support.astron.nl/lofar_issuetracker/issues/10448
 ==== LTA ==== ==== LTA ====
  
-  * Update and reboot when required (Reinoud)+  * ✔ Update and reboot when required (Reinoud)
  
 ==== Aartfaac ==== ==== Aartfaac ====
  
-  * Check for broken disks+  * ✘ Check for broken disks **Fail**: ais007 has a degraded RAID1  !! 
 +  * **For review**: Add Jasmin & Reinoud to all nodes as admin
 ==== Core switches ==== ==== Core switches ====
  
Line 62: Line 64:
  
  
 +==== Communication issues ====
  
 +**For review**: At the end of the 1st day software support needs to report status to coordinator
 ===== Software updates ===== ===== Software updates =====
  
Line 75: Line 79:
 ==== CEP3 ==== ==== CEP3 ====
  
-  * Reboot / fs checks +  * ✔ Reboot / fs checks 
-  * Make AOFlagger 2.10 the default version (already installed) +  * ✔ Make AOFlagger 2.10 the default version (already installed) 
-  * Make LOFAR-Release-3_0_14 the default version (linked against AOFlagger 2.10) +  * ✔ Make LOFAR-Release-3_0_14 the default version (linked against AOFlagger 2.10) 
-  * Make WSClean 2.5 the default version (already installed)+  * ✔ Make WSClean 2.5 the default version (already installed)
  
 ==== CEP4 ==== ==== CEP4 ====
  • Last modified: 2018-04-10 10:53
  • by grit