Latent Space: The AI Engineer Podcast cover image

Latent Space: The AI Engineer Podcast

ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)

Jun 10, 2024
Expert guests Graham Neubig and Aman Sanger discuss AI topics like Code Edits, Sandboxes, Academia vs Industry. They delve into Benchmarks like SWEBench, Dataset Contamination Detection, and GAIA Benchmark. The conversation also touches on Reasoning - Self-RAG, Let's Verify Step By Step, and developments in multi-agent systems with MetaGPT.
04:29:19

Podcast summary created with Snipd AI

Quick takeaways

  • Academic industry collaboration in language models needs improvement for inclusive research environment.
  • Benchmark evolution from DARPA to polymorphic era requires unified community effort for standardization.

Deep dives

Evolution of Language Models and Industry-Academia Collaboration

The changing role of academia from leading research to competing with industry in developing large language models is notable. The collaboration and acknowledgment of academic works by industry, particularly in the field of language models, needs improvement to foster a more inclusive and recognized research environment.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner