Loading...

Abstracts: NeurIPS 2024 with Jindong Wang and Steven Euijong Whang

Abstracts: NeurIPS 2024 with Jindong Wang and Steven Euijong Whang

Researcher Jindong Wang and Associate Professor Steven Euijong Whang explore the NeurIPS 2024 work ERBench. ERBench leverages relational databases to create LLM benchmarks that can verify model rationale via keywords in addition to checking answer correctness. 

Read the paper

Get datasets and codes

Published on:

Learn more
Microsoft Research Podcast
Microsoft Research Podcast

An ongoing series of conversations bringing you right up to the cutting edge of Microsoft Research.

Share post:

Related posts

Stay up to date with latest Microsoft Dynamics 365 and Power Platform news!
* Yes, I agree to the privacy policy