What runs GPT-4o and Microsoft Copilot? | Largest AI supercomputer in the cloud | Mark Russinovich
Microsoft has built the world’s largest cloud-based AI supercomputer that is already exponentially bigger than it was just 6 months ago, paving the way for a future with agentic systems.
For example, its AI infrastructure is capable of training and inferencing the most sophisticated large language models at massive scale on Azure. In parallel, Microsoft is also developing some of the most compact small language models with Phi-3, capable of running offline on your mobile phone.
Watch Azure CTO and Microsoft Technical Fellow Mark Russinovich demonstrate this hands-on and go into the mechanics of how Microsoft is able to optimize and deliver performance with its AI infrastructure to run AI workloads of any size efficiently on a global scale.
This includes a look at: how it designs its AI systems to take a modular and scalable approach to running a diverse set of hardware including the latest GPUs from industry leaders as well as Microsoft’s own silicon innovations; the work to develop a common interoperability layer for GPUs and AI accelerators, and its work to develop its own state-of-the-art AI-optimized hardware and software architecture to run its own commercial services like Microsoft Copilot and more.
► QUICK LINKS:
00:00 - AI Supercomputer
01:51 - Azure optimized for inference
02:41 - Small Language Models (SLMs)
03:31 - Phi-3 family of SLMs
05:03 - How to choose between SLM & LLM
06:04 - Large Language Models (LLMs)
07:47 - Our work with Maia
08:52 - Liquid cooled system for AI workloads
09:48 - Sustainability commitments
10:15 - Move between GPUs without rewriting code or building custom kernels. 11:22 - Run the same underlying models and code on Maia silicon
12:30 - Swap LLMs or specialized models with others
13:38 - Fine-tune an LLM
14:15 - Wrap up
► Unfamiliar with Microsoft Mechanics?
As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft.
• Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries
• Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog
• Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast
► Keep getting this insider knowledge, join us on social:
• Follow us on Twitter: https://twitter.com/MSFTMechanics
• Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/
• Enjoy us on Instagram: https://www.instagram.com/msftmechanics/
• Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics
Published on:
Learn more
Made for tech enthusiasts and IT professionals. Expanded coverage of your favorite technologies across Microsoft; including Office, Azure, Windows and Data Platforms. We'll even bring you broader topics such as device innovation with Surface, machine learning, and predictive analytics.
Related posts
Multi‑Agent Workflows in Microsoft Copilot Studio
Microsoft Ignite 2025 in San Francisco was an great experience — energy, innovation, and a huge leap forward for Frontier Firms, Digital Emplo...
Microsoft Copilot (Microsoft 365): Now smarter with visuals: Declarative Agents leverage embedded images for richer, more accurate answers
Declarative Agents have been enhanced to interpret and ground responses using images embedded in files such as Word documents (.docx), PowerPo...
Visualize Dataverse Insights Using Code Interpreter in Microsoft Copilot Studio
Microsoft Copilot Studio is rapidly transforming how business users analyze data, automate tasks, and create intelligent solutions. One of its...
Microsoft Copilot (Microsoft 365): Microsoft 365 Copilot – Chat & Search Browser Extension for Chrome
The Microsoft 365 Copilot browser extension for Chrome brings Copilot Chat and Search directly into the browser, enabling users to ask questio...
AI Agent Security: Applying Presume Breach and Least Privilege in Microsoft Copilot Studio & Power Automate
AI-backed tools are powerful and easy to develop. Give an agent access and clear instructions, and in many cases, it can just do the job. Howe...
Microsoft Copilot Studio – Copy an agent in Copilot Studio Lite to the Copilot Studio full experience
Update: Release of this feature has been updated. We are announcing the ability to copy an agent in Copilot Studio Lite into the Microsoft Cop...
Easily build new Apps and Workflows with Microsoft Copilot
Microsoft have introduced two new Copilot Agents that make it even easier for users to build new Apps and Workflows. What is a Copilot Agent? ...
Microsoft Copilot (Microsoft 365): Ticket Status updates more frequently in ServiceNow Tickets Copilot Connector
ServiceNow Tickets Copilot connector did not use to correctly update the ticket status because of not ingesting inactive tickets. But now, the...
Microsoft Copilot (Microsoft 365): Agent Mode in Excel
Agent Mode lets you build and edit workbooks side by side with Copilot. When you’re updating budgets, creating financial models, or anal...