
🚨 Outage Summary
- When: The disruption started around 15:45 UTC on October 29 and was largely mitigated early on October 30, 2025. 
- Cause: Microsoft traced the root cause to an "inadvertent tenant configuration change" within its Azure Front Door (AFD) service, a global content delivery network. - A software defect allowed a faulty deployment to bypass safety validations, leading to an invalid configuration state. 
- Impact: The outage impacted services relying on Azure Front Door for global content delivery, including: - Microsoft Services: Azure Portal access, Microsoft 365 services (Outlook, Teams, Admin Center), Xbox Live, and Minecraft. 
- External Customers: Airlines (like Alaska Airlines), retail companies (like Starbucks and Costco), financial institutions (like NatWest), and others, resulting in issues like website failures, login problems, and check-in disruptions. 
 
🛠️ Resolution
- Microsoft's engineers addressed the issue by blocking all further configuration changes to AFD services and rolling back to a "last known good" configuration across their global network. 
- The company has implemented additional validation and rollback controls and will conduct an internal review (Post Incident Review or PIR), which will be shared with affected customers within 14 days. 
 
 
No comments:
Post a Comment