Azure CycleCloud ‘Learning Circle’ Series for Academic and Industry Customers
Why a 'Learning Circle'?
It is no secret that - since the dawn of time (or thereabouts) - circles have signified unity, safety and equality. Several of our Azure HPC customer contacts from a variety of organisations - both academic and industrial - who work with Azure CycleCloud on a daily basis had expressed an interest in coming together with the engineers and developers to deepen their understanding of this enterprise-friendly tool for orchestrating HPC workloads on Azure: thus the Azure CycleCloud ‘Learning Circle’ workshop series was born.
Run by Microsoft's Azure HPC and AI Product Engineering Teams, these targeted workshop sessions took place in the first 2 weeks of November and covered a number of topics which had been proposed in advance by the session participants. To make the most efficient use of time - and to accommodate interaction between the Americas / EMEA time-zones - it was agreed that it made most sense for the Learning Circle to be run as two shorter sessions as opposed to a day-long event.
Session 1
Taking place on 5th November 2021, the first session comprised an interactive workshop comprising discussion, demonstration and Q&A and which was led by Microsoft’s Andy Howard, Dan Harris and Doug Clayton.
Topics of discussion for Session 1 included an in-depth look at cluster template parameters, the steps to provision a cluster, interactions with the Azure API, customization, resilience, metrics and software updates.
While the session was very much tailored to the specific questions proposed in advanced by the session participants, there was also a large degree of flexibility which was commented on very positively in the customer feedback:
“The flexibility of the staff who attended from the Microsoft side to go off the planned topics and explore specific questions was helpful and, despite this, I felt they were still sensitive to ensuring we covered everything that was wanted by the attendees.”
Session 2
The second session took place the following week on 10th November 2021. This was once again delivered in a similar format but this time with a Slurm focus. Led by Microsoft’s Andy Howard and Ryan Hamel, the presenters were also pleased to welcome Nick Ihli, Director of Sales Engineering at SchedMD.
Starting with a Session 1 recap, the participants then had an opportunity to follow up with additional questions which had arisen over the course of the week.
Given that the same week had seen the public announcement of the HBv3 Milan-X Preview this was also a talking-point, with discussion including how the newly-announced HBv3 VMs enhanced with AMD EPYC 3rd Gen processors with 3D v-cache (codenamed “Milan-X”) could be integrated seamlessly into the customers’ production systems (the good news was confirmed - that as soon as the Milan-X CPU appear in the APIs they will be available in CycleCloud, as Azure API is updated on a daily basis).
The session also went in-depth to cover CycleCloud’s management of Slurm configurations & parameters, with instructor Ryan showing and advanced preview of the upcoming new release of CC 8.2.1 (launched 12th November 2021) and articulating some of the main changes, including the ability to now do Slurm job accounting (see Slurm Cluster Updates) and Improved Cost Tracking which now shows approximate ongoing cluster costs and provides a REST API for fetching cost data programmatically.
Conclusion & Feedback
In summary this ‘Learning Circle’ series was one of the first of its kind, bringing together Microsoft software engineers and developers, Azure HPC specialists, Microsoft partner SchedMD and, most importantly providing an opportunity for some of our Azure HPC customers who have production HPC systems running in Azure to learn from each other and share ideas and insights.
Feedback from the sessions indicated that they had provided a positive and useful experience for all involved:
“Both sessions were useful and informative. As well as being able to speak directly with the technical staff, doing so in a forum of peers who were also using CycleCloud with SLURM was extremely valuable as we discussed questions and challenges that affect us all as well as sharing ideas.”
“Would love to see another deep dive session on CycleCloud in the near future!!”
“Very useful and a valuable use of time; friendly, relaxed and respectful; well organised and managed by our hosts.”
It is not often we have the privilege to run a session comprising Azure HPC customers from such a broad range of industry verticals - all of whom are supporting different HPC end-users and different scientific applications. So providing a 'Learning Circle' forum, allowing open discussion and Q&A between peers and subject-matter experts from different organisations provides a great opportunity for shared learning and development.
The Azure HPC team looks forward very much to running more of these deep dive sessions in the future.
Links & Resources
Current Release Notes - Azure CycleCloud 8.2.x - Azure CycleCloud | Microsoft Docs
Azure CycleCloud Documentation - Azure CycleCloud | Microsoft Docs
Published on:
Learn moreRelated posts
This Month in Azure Static Web Apps | 09/2024
We are back with another edition of the Azure Static Web Apps Community! :party_popper: September was yet another month ...
IPv6 Adoption: Enhancing Azure WAF on Front Door
The transition to IPv6 is a significant step for enterprise corporations, reflecting the evolution of internet technology and the need for a l...
Introducing the Data-Bound Reference Layer in Azure Maps Visual for Power BI
Imagine managing a nationwide sales team and needing to understand how your sales align with factors like population density, competitor locat...
GitHub Copilot for Azure: 6 Must-Try Features
As developers, we are constantly seeking tools that streamline our workflows and boost productivity. … Enter GitHub Copilot for Azure, now in ...
Unlocking the Best of Azure with AzureRM and AzAPI Providers
With the recent release of AzAPI 2.0, Azure offers two powerful Terraform providers to meet your infrastructure needs: AzureRM and AzAPI. The ...
Azure Communication Services Ideas Board: Share your feedback with the product team
Innovation is not a solitary pursuit, and we recognize that some of the best ideas come from you, our Azure Communication Services community. ...
Engage with the Azure Community Services Ideas Board: Your Voice Matters
Innovation is not a solitary pursuit, and we recognize that some of the best ideas come from you, our Azure Communication Services community. ...
Optimizing custom copilot (agent) performance with Azure Load Testing: A comprehensive guide
As we move into the next phase of digital transformation, the role of custom copilots is set to become increasingly pivotal. By leveragin...
Azure Storage - TLS 1.0 and 1.1 retirement
Overview TLS 1.0 and 1.1 retirement on Azure Storage was previously announced for Nov 1st, 2024, and it was postponed recently to 1 year later...