Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Last revisionBoth sides next revision | ||
public:stopdayactivities_4dec2018 [2018-12-05 10:35] – [Aartfaac] grit | public:stopdayactivities_4dec2018 [2018-12-13 13:32] – [Review meeting] grit | ||
---|---|---|---|
Line 141: | Line 141: | ||
==== Review meeting ==== | ==== Review meeting ==== | ||
- | * | + | When: 13-12-2018 11:00 Muller |
+ | Present: Arno, Matthijs(Slack), | ||
+ | |||
+ | |||
+ | - Central: Always start the stop-day with NIS updates. NIS is needed for Slurm and Lexars (ssh tunnel) | ||
+ | - Cep3: | ||
+ | - CEP4: | ||
+ | - Lexars: The lexars came up when NIS was down. This caused the ssh-tunnel to be broken. | ||
+ | - LCU family: Replacement of Rubidium in CN caused confusion and that took quite some time to figure out what was happening. | ||
+ | - LCU ILT: Not available on stop-day. They were rebooted the next Monday. In future: plan many months in advance, if possible every other stopday. | ||
+ | - Wincc: We need to set up a testsystem first. | ||
+ | - About Novell IDM: What is the timescale for replacement? | ||
+ | - Ldb003: Intel NIC did not work. We used internal NIC’s instead. | ||
+ | - Lofarlta01 not done. No time left. | ||
+ | - Dragnet: Are dragnet users aware of the cable change? SOS have skipped the tests, so the Dragnet team needs to do that. Mattijs will inform dragnet@astron.nl. | ||
+ | - Aartfaac-lcu was upgraded and rebooted the next Wednesday. It had over 500 packages to be installed. It went fine in the end. | ||
+ | - Network: | ||
+ | - The Zabbix server crashed during upgrade. Reinstall was needed and that took some hours. Cause unknown. | ||
+ | - Dwingeloo systems were also updated & rebooted using spacewalk. | ||
+ | - Scu001 has no NFS mount. Remove it from SDOS checks. | ||
+ | - Triggered observation test failed. There seems to be a bug in the script. | ||
+ | - Matthijs: Cep3. Should SOS inform users? Yes, the coordinator will report when accounts are back and Slurm is up. SOS needs to check and inform users thereafter. The SOS checks are still being defined. | ||
+ | - Network overhaul. Validation run in front of stop-day was not done due to miscommunication. Please wait for acknowledgement. |