Final Update: Tuesday, 02 November 2021 23:33 UTC
We've confirmed that all systems are back to normal with no customer impact as of 11/02, 22:10 UTC. Our logs show the incident started on 11/02, 17:00 UTC and that during the 5 hours and 10 minutes that it took to resolve the issue some customers may be experiencing high latency while accessing metrics and may have experienced failure while receiving alerts.
- Root Cause: The failure was due to a backend dependency becoming unhealthy
- Incident Timeline: 5 Hours & 10 minutes - 11/02, 17:00 UTC through 11/02, 22:10 UTC
We understand that customers rely on Azure Monitor as a critical service and apologize for any impact this incident caused.
-Eric Singleton
Update: Tuesday, 02 November 2021 22:26 UTC
We continue to investigate issues within Azure Monitor. Root cause looks to be due to a dependency which had become overloaded. To address this, currently a rollback is being initiated as a potential mitigation. Some customers may be experiencing high latency while accessing metrics and may have experienced failure while receiving alerts.
- Work Around: None
- Next Update: Before 11/03 00:30 UTC
-Eric Singleton
Update: Tuesday, 02 November 2021 20:45 UTC
We continue to investigate issues within Azure Monitor. Some customers may be experiencing high latency while accessing metrics and may have experienced failure while receiving alerts.
- Work Around: None
- Next Update: Before 11/02 23:00 UTC
-Eric Singleton
Initial Update: Tuesday, 02 November 2021 19:20 UTC
We are aware of issues within Azure Monitor and are actively investigating. Some customers may be experiencing high latency while accessing metrics and may have experienced failure while receiving alerts.
- Work Around: none
- Next Update: Before 11/02 21:30 UTC
We are working hard to resolve this issue and apologize for any inconvenience.
-Eric Singleton