The client currently has several hundred servers in use in production. There are a few monitoring options as yet, but these react too late or do not match the client’s requirements.
The server landscape is supposed to be monitored with a monitoring solution (Nagios). Some additional checks are created (DB checks, PowerShell scripts, etc.)
Specific action steps are defined and the system is monitored 24/7. Therefore, it is possible to react quickly if an error occurs.
Thanks to the 24/7 monitoring and the rapid response, many downtimes could have been prevented before they even occurred. The client can count on a quick and reliable error recovery.
Downtimes due to errors have been reduced massively. Thanks to the possibility to develop individual checks, even complex errors can be prevented at an early stage.