Abstracts: September XX, 2023
Members of the research community at Microsoft work continuously to advance their respective fields. Abstracts brings its audience to the cutting edge with them through short, compelling conversations about new and noteworthy achievements.
In this episode, Dr. Xing Xie, a Senior Principal Research Manager of Microsoft Research Asia joins host Dr. Gretchen Huizinga to discuss “Psychometrics for Evaluating General-Purpose AI.” As AI capabilities move from task specific to more general purpose, the paper explores psychometrics, a subfield of psychology, as an alternative to traditional methods for evaluating model performance and for supporting consistent and reliable systems.
Read the paper:
Published on:
Learn more