Loading...

Episode 504 - Azure Reliability SRE

Episode 504 - Azure Reliability SRE

Sadaf Khan joins Evan and Russell to explain and talk about Service Reliability Engineering in the Azure engineering group.

 

Media file: https://azpodcast.blob.core.windows.net/episodes/Episode504.mp3

YouTube: https://www.youtube.com/watch?v=QNGdTnb1W90&t=1684s

 

Key Topics:

  • Azure Reliability SRE: Evan introduced the episode's focus on Azure reliability SRE and mentioned a special guest, Sadaf, who would provide insights on the topic. 0:19
  • Azure Storage Public Preview Feature: Russell discussed a new public preview feature for Azure storage that allows customers to manage planned failovers, enhancing the service's reliability. 1:10
  • Virtual Machine Scale Set Update: Russell highlighted an update to virtual machine scale sets that allows mixing different instances, improving flexibility and scalability. 1:38
  • Azure API Management Workspace: Russell introduced a new feature in Azure API management that enables teams to have more autonomy in managing and publishing APIs. 2:08
  • NetApp Files Storage Update: Russell mentioned the general availability of cool access for NetApp files storage, allowing for more cost-effective data storage based on access patterns. 2:40
  • Redis Cache Update: Russell discussed a new tier for Redis Cache that supports larger enterprises with increased memory and compute capabilities. 3:02
  • Azure Red Hat Openshift Update: Russell shared an update on Azure Red Hat Openshift, which now supports up to 250 nodes, significantly increasing scalability. 3:29
  • SRE Role and Impact: Sadaf explained the role of SRE in improving service reliability and quality, detailing their engagement model with various Azure services. 4:52
  • SRE Engagement and Resistance: Sadaf shared insights on the initial resistance faced from service teams during SRE engagements and how trust is built over time to allow for more impactful changes. 7:49
  • SRE's Approach to Service Improvement: Sadaf outlined the SRE team's structured approach to service improvement, focusing on fundamentals, service health, operational efficiency, and scalability. 10:51
  • AI Initiatives in SRE: Sadaf discussed the SRE team's initiatives in leveraging AI to analyze incident data and generate insights, aiming to reduce the cognitive load on engineers. 30:27

Published on:

Learn more
The Azure Podcast
The Azure Podcast

Short podcasts on various topics related to the Microsoft Cloud platform.

Share post:

Related posts

Announcing: Dynamic Data Masking for Azure Cosmos DB (Preview)

Today marks a big step forward with the public preview of Dynamic Data Masking (DDM) for Azure Cosmos DB. This feature helps organizations pro...

6 hours ago

Use Azure SRE Agent with Azure Cosmos DB: Smarter Diagnostics for Your Applications

We’re excited to announce the Azure Cosmos DB SRE Agent built on Azure SRE Agent; a new capability designed to simplify troubleshooting and im...

6 hours ago

General Availability: Priority-Based Execution in Azure Cosmos DB

Have you ever faced a situation where two different workloads share the same container, and one ends up slowing down the other? This is a comm...

6 hours ago

Announcing Preview of Online Copy Jobs in Azure Cosmos DB: Migrate Data with Minimal Downtime!

We are excited to announce the preview of Online Copy Jobs, a powerful new feature designed to make data migration between containers seamless...

6 hours ago

Azure Developer CLI (azd) Nov 2025 – Container Apps (GA), Layered Provisioning (Beta), Extension Framework, and Aspire 13

This post announces the November release of the Azure Developer CLI (`azd`). The post Azure Developer CLI (azd) Nov 2025 – Container App...

1 day ago

Announced at Ignite 2025: Azure DocumentDB, MCP Toolkit, Fleet Analytics, and more!

Microsoft Ignite 2025 kicked off with a wave of announcements for Azure Cosmos DB and Azure DocumentDB, setting the tone for a week of innovat...

1 day ago

Automating Microsoft Fabric Workspace Creation with Azure DevOps Pipelines

In today’s fast-paced analytics landscape, Microsoft Fabric has become the leader of enterprise BI implementations, one of the fundamental con...

1 day ago

New T-SQL AI Features are now in Public Preview for Azure SQL and SQL database in Microsoft Fabric

At the start of this year, we released a new set of T-SQL AI features for embedding your relational data for AI applications. Today, we have b...

2 days ago
Stay up to date with latest Microsoft Dynamics 365 and Power Platform news!
* Yes, I agree to the privacy policy