@mgorny 💯
I've been advocating this for years, and continue to do so: high uptime should be a monitoring alert condition.
It drives gaining several benefits for your systems and teams:
- normalizes maintenance tasks as part of usual routine
- lowers operational overhead
- enables faster incident response
- encourages regular upgrades
- improves robustness and resiliency
Rebooting a system should not be an exceptional ceremony after hours, it should be a standard operation during work hours.