Last Week in AI cover image

#134 - Text-to-Speech, Gartner Hype Cycle, AI2 OLMo, AlphaStar Unplugged, China Regulations, AI Porn Marketplace

Last Week in AI

00:00

Generative assessment project: Comparing language models on hallucinations and answering behavior

OpenAI unveiled the generative assessment project, a research initiative to explore strengths and weaknesses of language models. They compare how different models hallucinate answers and hedge instead of giving a direct answer. It's interesting because it's a unique comparison that hasn't been done before, especially with proprietary models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app