public:stopdayactivities_2oct2018

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Last revisionBoth sides next revision
public:stopdayactivities_2oct2018 [2018-08-29 12:53] – [Stop-day activities October 2-3, 2018] gritpublic:stopdayactivities_2oct2018 [2018-10-02 14:54] – [CEP3] Jasmin Klipic
Line 4: Line 4:
  
 ^ Coordinator | Jasmin Klipic | roadmin@astron.nl | ^ Coordinator | Jasmin Klipic | roadmin@astron.nl |
-^ Software Support | | softwaresupport@astron.nl | +^ Software Support | Adriaan Renting | softwaresupport@astron.nl | 
-^ Science, Operations and Support | | sos@astron.nl | +^ Science, Operations and Support |Pietro Zucca  | sos@astron.nl | 
-^ Observer | | observer@astron.nl |+^ Observer | Henk Mulder | observer@astron.nl |
  
  
Line 19: Line 19:
 ==== Cobalt ==== ==== Cobalt ====
  
-  * Reboots and idrac reboots. (Hopko)+  * ✔ Reboots and idrac reboots. (Hopko)
  
  
 ==== CEP3 ==== ==== CEP3 ====
  
-  * Block access at 08:00 (Teun) +  * ✔ Block access at 08:00 (Teun) 
-  * All nodes: file system check and reboot. (Kees)+  * ✔ Replace/reseat 7 broken disks (Kees) 
 +  * ✔ All nodes: file system check and reboot. (Kees)
  
-After the reboot a problem appeared, where the NIS netgroup "cep3" seem to miss 2 users. A workaround was implemented and it was solved the next day.+ 
  
  
Line 33: Line 34:
 ==== CEP4 ==== ==== CEP4 ====
  
-  * No Reboot needed. Was done last week(Hopko)+  * Lustre upgrade (Hopko/Robin/Reinoud) 
 +  * <del>Increase /tmp to 4GB and tmpfs</del> 
 +  * CEP4 /tmp  to tmpfs (will not be done) since can have impact to SLURM (will be discussed/planned for next STOP Day)
  
  
 ==== LEXARS ==== ==== LEXARS ====
  
-  *  No action. Were rebooted last week. (Hopko/Robin) +  *  No action.  (need to be checked maybe will be upgraded to 7.5?)
  
 ==== LCU ==== ==== LCU ====
  
-  * ?+  * ✔ No reboots
  
  
 ==== Portals ==== ==== Portals ====
-  * Update & reboot using Spacewalk+  * ✔ Update & reboot using Spacewalk
  
 ==== Central Services lcs020 .. lcs030 ==== ==== Central Services lcs020 .. lcs030 ====
    
-  * OS upgrade and reboot+  * ✔ OS upgrade and reboot 
 +  * ✔ lcs022/027/030/031 will be switched off (need confirmation from Jens K, and Teun)
  
 ==== Other Central Services ==== ==== Other Central Services ====
    
-  * OS upgrade and reboot+  * ✔ OS upgrade and reboot
  
  
 ==== LTA ==== ==== LTA ====
  
-  * None+  * ✔ None
  
    
 ==== Aartfaac ==== ==== Aartfaac ====
  
-  * ?+  * ✔ None
  
  
Line 85: Line 88:
 ==== CEP3 ==== ==== CEP3 ====
  
-  * Activate disk quota+  * Activate disk quota 
 +  * Check that software environment is the same on all nodes (lof015 included)
  
 ==== LCU ==== ==== LCU ====
Line 93: Line 97:
 ==== CEP4 ==== ==== CEP4 ====
  
-  * Lustre update+  * Slurm update (cpu/gpu nodes tbd) not done, this should be verified first
  
 ==== Aartfaac ==== ==== Aartfaac ====
Line 108: Line 112:
 ===== In the field ===== ===== In the field =====
  
-  + 
 +==== DWG Lofar Systems ==== 
 + 
 +  ✔ update & reboot (if possible)  
 ==== Review meeting ==== ==== Review meeting ====
  
 The minutes of the review meeting can be found [[public:stopday_review_Oct_2018|here]]. The minutes of the review meeting can be found [[public:stopday_review_Oct_2018|here]].
  • Last modified: 2018-10-10 13:13
  • by Jasmin Klipic