Loading...

Episode 504 - Azure Reliability SRE

Episode 504 - Azure Reliability SRE

Sadaf Khan joins Evan and Russell to explain and talk about Service Reliability Engineering in the Azure engineering group.

 

Media file: https://azpodcast.blob.core.windows.net/episodes/Episode504.mp3

YouTube: https://www.youtube.com/watch?v=QNGdTnb1W90&t=1684s

 

Key Topics:

  • Azure Reliability SRE: Evan introduced the episode's focus on Azure reliability SRE and mentioned a special guest, Sadaf, who would provide insights on the topic. 0:19
  • Azure Storage Public Preview Feature: Russell discussed a new public preview feature for Azure storage that allows customers to manage planned failovers, enhancing the service's reliability. 1:10
  • Virtual Machine Scale Set Update: Russell highlighted an update to virtual machine scale sets that allows mixing different instances, improving flexibility and scalability. 1:38
  • Azure API Management Workspace: Russell introduced a new feature in Azure API management that enables teams to have more autonomy in managing and publishing APIs. 2:08
  • NetApp Files Storage Update: Russell mentioned the general availability of cool access for NetApp files storage, allowing for more cost-effective data storage based on access patterns. 2:40
  • Redis Cache Update: Russell discussed a new tier for Redis Cache that supports larger enterprises with increased memory and compute capabilities. 3:02
  • Azure Red Hat Openshift Update: Russell shared an update on Azure Red Hat Openshift, which now supports up to 250 nodes, significantly increasing scalability. 3:29
  • SRE Role and Impact: Sadaf explained the role of SRE in improving service reliability and quality, detailing their engagement model with various Azure services. 4:52
  • SRE Engagement and Resistance: Sadaf shared insights on the initial resistance faced from service teams during SRE engagements and how trust is built over time to allow for more impactful changes. 7:49
  • SRE's Approach to Service Improvement: Sadaf outlined the SRE team's structured approach to service improvement, focusing on fundamentals, service health, operational efficiency, and scalability. 10:51
  • AI Initiatives in SRE: Sadaf discussed the SRE team's initiatives in leveraging AI to analyze incident data and generate insights, aiming to reduce the cognitive load on engineers. 30:27

Published on:

Learn more
The Azure Podcast
The Azure Podcast

Short podcasts on various topics related to the Microsoft Cloud platform.

Share post:

Related posts

Announcing: Dynamic Data Masking for Azure Cosmos DB (Preview)

Today marks a big step forward with the public preview of Dynamic Data Masking (DDM) for Azure Cosmos DB. This feature helps organizations pro...

17 hours ago

Use Azure SRE Agent with Azure Cosmos DB: Smarter Diagnostics for Your Applications

We’re excited to announce the Azure Cosmos DB SRE Agent built on Azure SRE Agent; a new capability designed to simplify troubleshooting and im...

17 hours ago

General Availability: Priority-Based Execution in Azure Cosmos DB

Have you ever faced a situation where two different workloads share the same container, and one ends up slowing down the other? This is a comm...

17 hours ago

Announcing Preview of Online Copy Jobs in Azure Cosmos DB: Migrate Data with Minimal Downtime!

We are excited to announce the preview of Online Copy Jobs, a powerful new feature designed to make data migration between containers seamless...

17 hours ago

Azure Developer CLI (azd) Nov 2025 – Container Apps (GA), Layered Provisioning (Beta), Extension Framework, and Aspire 13

This post announces the November release of the Azure Developer CLI (`azd`). The post Azure Developer CLI (azd) Nov 2025 – Container App...

1 day ago

Announced at Ignite 2025: Azure DocumentDB, MCP Toolkit, Fleet Analytics, and more!

Microsoft Ignite 2025 kicked off with a wave of announcements for Azure Cosmos DB and Azure DocumentDB, setting the tone for a week of innovat...

1 day ago

Automating Microsoft Fabric Workspace Creation with Azure DevOps Pipelines

In today’s fast-paced analytics landscape, Microsoft Fabric has become the leader of enterprise BI implementations, one of the fundamental con...

2 days ago

New T-SQL AI Features are now in Public Preview for Azure SQL and SQL database in Microsoft Fabric

At the start of this year, we released a new set of T-SQL AI features for embedding your relational data for AI applications. Today, we have b...

2 days ago
Stay up to date with latest Microsoft Dynamics 365 and Power Platform news!
* Yes, I agree to the privacy policy