The service in charge of scheduling uptime monitoring experienced a complete outage for 3 hours and 11 minutes. During this period, no uptime monitoring could be performed, affecting all users relying on our service between 16:34 UTC to 19:45 UTC
The issue has been resolved quickly once identified and the root cause identified. A misconfigured step in our deployment step did not correctly restart the service in charge or scheduling uptime monitoring, and our current observability stack did not catch this particular case as an issue.
An upgrade of our internal observability stack has been scheduled to be alerted quickly in case a similar problem arise in the future, has this incident highlighted areas for improvement.
Thank you for your continued trust in our uptime monitoring service.