
#134 - Text-to-Speech, Gartner Hype Cycle, AI2 OLMo, AlphaStar Unplugged, China Regulations, AI Porn Marketplace
Last Week in AI
00:00
Generative assessment project: Comparing language models on hallucinations and answering behavior
OpenAI unveiled the generative assessment project, a research initiative to explore strengths and weaknesses of language models. They compare how different models hallucinate answers and hedge instead of giving a direct answer. It's interesting because it's a unique comparison that hasn't been done before, especially with proprietary models.
Transcript
Play full episode