Loading...

[Mitigated] Azure Lab Services - Lab Plan Outage

[Mitigated] Azure Lab Services - Lab Plan Outage

Azure Lab Service is experiencing an outage that is affecting Lab Plans, but not Lab Accounts. This outage intermittently impacts all operations in the following regions:

  • Australia East
  • East US
  • North Europe
  • South Central US
  • Southeast Asia
  • UAE North
  • UK South
  • West Europe

Impacted customers are encouraged to use unaffected regions as a workaround.  We apologize for any inconvenience this may cause.

 

Update 5/13: A potential hotfix is being tested.  We also have temporarily disabled lab schedules which means that VMs will not automatically start/stop based on schedules.  Please refer back to this blog post for updates.

 

Update 5/14 (8:00 AM Central): The engineering team is still working on a hotfix and validating that this addresses the issue before rolling the fix out incrementally.

 

Update 5/14 (12:00 PM Central): The root-cause has been confirmed and further testing of the hotfix is expected to be completed within the next 1-2 hours in preparation for a widespread roll-out.  We expect the hotfix to be rolled out shortly after that.  If you have a tight timeline and are currently unable to use labs, we still recommend recreating labs in an unimpacted region if possible.

 

Update 5/14 (4:00 PM Central): The engineering team has completed testing the hotfix and verified that it addresses the underlying issue causing the outage.  They are in process of rolling out the hotfix first to Southeast Asia which is one of the impacted regions.  Within the next few hours, we'll provide an update when you can expect the hotfix to deployed to the other impacted regions.

 

Update 5/15 (1:00 AM Central):  The initial hotfix has been deployed and although it addressed the underlying issue, the regional processing isn't recovering as expected.  Upon further investigation there was an additional underlying issue uncovered which is slowing down processing the backlog of operations.  The engineering team is actively working on creating a new hotfix for the underlying issue.

 

Update 5/15 (9:00 PM Central):  We recognize the frustration and inconvenience this outage is causing for our customers with labs in the impacted regions, and we sincerely apologize. We have made positive progress in our investigation and validated that the outage no longer exists in the following regions - however, you may see slower lab creation and start/stop VM performance:

  • Australia East
  • Australia Southeast
  • Brazil South
  • Canada Central
  • Canada East
  • Central India
  • Central US
  • East Asia
  • East US
  • France Central
  • Germany West Central
  • Japan East
  • Korea Central
  • North Central US
  • North Europe
  • Norway East
  • South Africa North
  • Southeast Asia
  • Switzerland North
  • UAE North
  • UK South
  • UK West
  • West Central US
  • West Europe
  • West US

For the remaining impacted regions, please know that we have escalated the matter and several engineering teams are working diligently to explore mitigation options. For transparency, we anticipate that the investigation and resolution may take a one additional business day for the following regions that are still impacted:

  • East US 2
  • South Central US

 

Update 5/16 (8pm Central): All regions are running and processing as expected including SouthCentralUS and EastUS2 (mentioned above). We have also confirmed that EastUS is also processing jobs as expected (there were confirmed slowdowns earlier today). One side effect of the outage will be failed operations - you may see VMs showing failures to start, labs failing to be created, labs failing to publish, etc. Please retry those operations. If you encounter any additional issues in any region with Azure Lab Services, please open an Azure support ticket for us to investigate.

 

Update 5/16 (9pm Central):  One last update, we wanted to let you know that schedules have been re-enabled in all regions.  Please open an Azure support ticket if you see any issues!

 

Update 5/20 (3pm Central):  We wanted to post an update to let you know that we've received several Azure support tickets relating to slowdowns in the EastUS region.  This appears to be sporadic (not everyone encounters slowdowns) and the engineering team is actively investigating.  The best route for support is opening an Azure support ticket - we will provide an update back here once the slowdowns are resolved.


Update 5/22 (11am Central):  The engineering team continues to investigate the sporadic slow operations in the platform.  If you have any stuck operations (virtual machines are stuck starting, stuck stopping, etc) or you see very slow operations (more than 15min), the best next step is to notify the team by opening an Azure support ticket.

 

Update 5/23 (6pm Central): The engineering team has identified scalability issues resulting in intermittent slow operations with the service hardware. We are now conducting internal tests with upgraded hardware and expect to implement the changes by 5/31.

Published on:

Learn more
Azure Lab Services articles
Azure Lab Services articles

Azure Lab Services articles

Share post:

Related posts

Azure Elastic SAN for Azure VMware Solution: now Generally Available

Have you been looking to expand your storage on Azure VMware Solution (AVS), but do not need the extra compute performance and the associated ...

9 hours ago

Introducing Pull Request Annotation for CodeQL and Dependency Scanning in GitHub Advanced Security for Azure DevOps

In the world of software development, security is paramount. As developers, we strive to write clean, efficient, and most importantly, secure ...

1 day ago

Accelerate metadata heavy workloads with Metadata Caching preview for Azure Premium Files SMB & REST

Azure Files previously announced the limited preview of Metadata caching highlighting improvements on the metadata latency (up to 55...

2 days ago

How to Choose the Right Models for Your Apps | Azure AI

With more than 1700 models to choose from on Azure, selecting the right one is key to enabling the right capabilities, at the right price poin...

2 days ago

MMR Call Redirection for Azure Virtual Desktop, Windows 365 now available

Today, I am pleased to share the launch of Multimedia Redirection (MMR) Call Redirection for Azure Virtual Desktop and Windows 365. Call Redir...

2 days ago

Liquid Cooling in Air Cooled Data Centers on Microsoft Azure

With the advent of artificial intelligence and machine learning (AI/ML), hyperscale datacenters are increasingly accommodating AI accelerators...

2 days ago

Introducing Azure Product Retirement Livestreams

The Azure Retirements team, in collaboration with key partner groups, is excited t...

2 days ago

Azure Developer CLI (azd) – October 2024

This post announces the October release of the Azure Developer CLI (`azd`), including configurable api-version for ACA. The post Azure Develop...

3 days ago

[Solved] Azure Function is not showing in the List in Azure Function App in Portal after Published from Visual Studio

While working with Azure Function and Publishing to Azure, you may find that your function gets published from Visual studio but your function...

3 days ago
Stay up to date with latest Microsoft Dynamics 365 and Power Platform news!
* Yes, I agree to the privacy policy