Liquid Cooling in Air Cooled Data Centers on Microsoft Azure

With the advent of artificial intelligence and machine learning (AI/ML), hyperscale datacenters are increasingly accommodating AI accelerators at scale, demanding higher power at higher density than is customary in traditionally air-cooled facilities.
As Microsoft continues to expand our growing datacenter fleet to enable the world’s AI transformation, we are faced with a need to develop methods for utilizing air-cooled datacenters to provide liquid cooling capabilities for new AI. Additionally, increasing per-rack-density for AI accelerators necessitates the use of standalone liquid-to-air heat-exchangers to support legacy datacenters that are typically not equipped with the infrastructure to support direct-to-chip (DTC) liquid cooling.
A solution: standalone liquid cooling heat exchanger units.
Microsoft’s Maia 100 platform marked the first introduction of a liquid cooling heat exchanger in existing air-cooled data centers for direct-to-chip liquid cooling. Since that time, we have continued to invest in novel cooling techniques to accommodate newer, more powerful AI/ML processors. Today at OCP 2024, we are sharing contributions for designing advanced liquid cooling heat exchanger units (HXU). By open sourcing our design approach through the Open Compute Project, we hope to share our HXU development work to enable closed-loop liquid cooling in AI datacenters across the entire computing industry.
Heat Exchanger Unit Design Principles
Our designs for HXUs focus on enabling advanced cooling capacity for modern AI processors, improving operating efficiency to reduce power demand, and enabling AI accelerator racks to operate in traditionally air-cooled data centers.
Microsoft’s vision for enhanced effectiveness centers on using the same chilled air that legacy datacenters are already providing for air-cooled platforms. Our engineering spec for HXUs targets the relative liquid and air flow rates required to supply the cooling liquid at the required temperature to the IT equipment.
The design principles for HXUs are the result of a close partnership with Delta and Ingrasys. Working with these partners has helped us evolve our approach, including double-wide rack to increase heat dissipation capacity, and specialized packaging to ensure leak-free transport. Envisioning HXUs with a modular design allows field servicing of key components, including pumps, fans, filters, printed circuit board assembly, and sensors. Quick disconnects and strategically placed leak detection ropes, along with drip pans that guide liquids to the base of an HXU, help mitigate and contain liquid leaks. Fans are placed at the rear to avoid pre-heating within an HXU and eliminate entrainment issues in the cold aisle. The modular fluid connections between HXUs and server racks allow for various configurations.
We welcome further collaboration from the broader OCP community in enabling the future of datacenter power and cooling innovation with state-of-the-art infrastructure engineering capabilities.
Published on:
Learn moreRelated posts
Creating an Agent with Actions in Azure AI Foundry
Azure AI Foundry is an Azure service where you can create agents using various LLMs (including your own). In this post we will look at how to ...
New Test Run Hub in Azure Test Plans
Delivering high-quality software is a necessity and that’s why Azure Test Plans has introduced the all-new Test Run Hub, an enabler for teams ...
Microsoft Teams: New SlimCore-based optimization for Microsoft Teams in VDI – support for MacOS on Citrix and Azure Virtual Desktops/Windows 365
This feature allows MAC endpoints to optimize Microsoft Teams in VDI environments with the new SlimCore-based media engine, providing an expan...
Microsoft Whiteboard: Azure to OneDrive migration progress update
Microsoft Whiteboard storage is migrating from Azure to OneDrive, starting February 2024 and completing by August 2025, with full deprecation ...
Copilot Studio: Azure AI Search Complete Setup Guide
Copilot Studio can use an Azure AI Search index as knowledge to answer Users questions ... The post Copilot Studio: Azure AI Search Complete S...
Microsoft Azure Fundamentals #1: Creating External Tenants in Entra ID: A Step-by-Step Guide
It is important to configure external tenants for different scenarios. In this post we can see how to create a tenant step by step so that it ...
Azure Information Protection: Enable multifactor authentication for your Azure tenant by October 1, 2025
Microsoft will enforce multifactor authentication (MFA) for all Azure resource management actions starting October 1, 2025, with a postponemen...
Azure Automation Custom Runtime Environments
A custom runtime environment is a way of defining a specific job execution environment for Azure Automation runbooks, including Microsoft Grap...
Dynamics 365 Customer Insights – Data – Export your data to Azure Data Lake Storage
We are announcing the general availability of the export to Azure Data Lake Storage (ADLS) feature in Dynamics 365 Customer Insights – Data on...
Dynamics 365 Business Central: Quickly find the Tenant ID, Azure AD Instance, and Tenant Scope from the domain (tenant) name without signing in
Hi, Readers.Today I would like to share another mini tip, how to quickly find the Tenant ID, Azure AD Instance, and Tenant Scope from the doma...