Loading...

AI Testing and Evaluation: Reflections

In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.

Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/

Published on: July 21, 2025

Learn more

Microsoft Research Podcast

An ongoing series of conversations bringing you right up to the cutting edge of Microsoft Research.

Share post:

More from this blog

Can we AI our way to a more sustainable world?

Doug Burger, sustainability expert Amy Luers, and optimization researcher Ishai Menache examine the ...

Ideas: Steering AI toward the work future we want

Microsoft Chief Scientist Jaime Teevan and researchers Jenna Butler, Jake Hofman, and Rebecca Jansse...

Will machines ever be intelligent?

Are machines truly intelligent? AI researchers Subutai Ahmad and Nicolò Fusi join Doug Burger to com...

Trailer: The Shape of Things to Come

Microsoft research lead Doug Burger introduces his new podcast series, The Shape of Things to Come, ...

Ideas: Community building, machine learning, and the future of AI

As the Women in Machine Learning Workshop (WiML) marks its 20th annual gathering, cofounders, friend...

Ideas: More AI-resilient biosecurity with the Paraphrase Project

Microsoft’s Eric Horvitz and bioscience experts Tessa Alexanian, James Diggans, and Bruce Wittmann d...

Coauthor roundtable: Reflecting on healthcare economics, biomedical research, and medical education

For the series finale, Peter Lee, Carey Goldberg, and Dr. Zak Kohane compare their predictions to in...

Reimagining healthcare delivery and public health with AI

Former Washington State Secretary of Health Dr. Umair Shah and Mayo Clinic CEO Dr. Gianrico Farrugia...

Navigating medical education in the era of generative AI

Next-generation physicians Morgan Cheatham and Daniel Chen discuss how generative AI is transforming...

Relevant topics:

Stay up to date with latest Microsoft Dynamics 365 and Power Platform news!

* Yes, I agree to the privacy policy

AI Testing and Evaluation: Reflections

Related posts