What runs GPT-4o and Microsoft Copilot? | Largest AI supercomputer in the cloud | Mark Russinovich
 
                
Microsoft has built the world’s largest cloud-based AI supercomputer that is already exponentially bigger than it was just 6 months ago, paving the way for a future with agentic systems.
For example, its AI infrastructure is capable of training and inferencing the most sophisticated large language models at massive scale on Azure. In parallel, Microsoft is also developing some of the most compact small language models with Phi-3, capable of running offline on your mobile phone.
Watch Azure CTO and Microsoft Technical Fellow Mark Russinovich demonstrate this hands-on and go into the mechanics of how Microsoft is able to optimize and deliver performance with its AI infrastructure to run AI workloads of any size efficiently on a global scale.
This includes a look at: how it designs its AI systems to take a modular and scalable approach to running a diverse set of hardware including the latest GPUs from industry leaders as well as Microsoft’s own silicon innovations; the work to develop a common interoperability layer for GPUs and AI accelerators, and its work to develop its own state-of-the-art AI-optimized hardware and software architecture to run its own commercial services like Microsoft Copilot and more.
► QUICK LINKS:
 00:00 - AI Supercomputer
 01:51 - Azure optimized for inference
 02:41 - Small Language Models (SLMs)
 03:31 - Phi-3 family of SLMs
 05:03 - How to choose between SLM & LLM
 06:04 - Large Language Models (LLMs)
 07:47 - Our work with Maia
 08:52 - Liquid cooled system for AI workloads
 09:48 - Sustainability commitments 
10:15 - Move between GPUs without rewriting code or building custom kernels. 11:22 - Run the same underlying models and code on Maia silicon
 12:30 - Swap LLMs or specialized models with others
13:38 - Fine-tune an LLM
 14:15 - Wrap up
► Unfamiliar with Microsoft Mechanics?
As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft.
• Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries
• Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog
• Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast
► Keep getting this insider knowledge, join us on social:
• Follow us on Twitter: https://twitter.com/MSFTMechanics
• Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/
• Enjoy us on Instagram: https://www.instagram.com/msftmechanics/
• Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics
Published on:
Learn more 
        Made for tech enthusiasts and IT professionals. Expanded coverage of your favorite technologies across Microsoft; including Office, Azure, Windows and Data Platforms. We'll even bring you broader topics such as device innovation with Surface, machine learning, and predictive analytics.
Related posts
Microsoft Copilot Studio: Introducing the pre-purchase plan (P3)
Microsoft Copilot Studio launches a Pre-Purchase Plan (P3) on November 1, 2025, offering annual prepaid Commit Units with discounts, a single ...
Microsoft Copilot Chat: AI disclaimer updates
Microsoft Copilot Chat will update its AI Disclaimer experience starting late November 2025, making the disclaimer off by default with an admi...
Microsoft Copilot (Microsoft 365): Use voice for Q&A during Read Aloud in Word
Interact in real time with read aloud in Word with Copilot. While your document is being read aloud, you can ask questions with your voice—suc...
Microsoft Copilot (Microsoft 365): Real-time voice interactions in a podcast in Word
Interact in real time with podcasts created by Copilot in Word. Today, audio summaries help you absorb content on the go, but they’re one-way—...
Microsoft Copilot (Microsoft 365): Computer Use in Researcher
Researcher with Computer Use can securely interact with public, gated, and interactive web content using a virtual computer—enabling you to un...
Microsoft Copilot (Microsoft 365): Floating Copilot icon in File Previewer for OneDrive
A new Copilot icon button provides a quick, visible entry point to Copilot directly within the file previewer in OneDrive. Positioned at the b...
Microsoft Copilot (Microsoft 365): [Copilot Chat] Updated UI for Microsoft Copilot Navigation Pane in Word, Excel, PowerPoint, OneNote, Outlook, and Teams
We’re refreshing the Copilot Chat navigation pane to simplify the agents list and increase the visible chat history list beyond a user’s...
Microsoft Copilot (Microsoft 365): Floating Copilot button in OneDrive
The floating Copilot button is a quick-access button that appears prominently in your OneDrive web interface. It’s your gateway to Copilot-pow...
Microsoft Copilot Studio – Add and configure tool groups to agents
We are announcing the ability to add and configure tool groups to agents in Microsoft Copilot Studio. This feature will reach general availabi...
Microsoft Copilot Studio – Strengthen security of Copilot Studio agents with additional threat protection
We are announcing the ability to strengthen security of Copilot Studio agents with additional threat protection in Microsoft Copilot Studio. T...
