The Inside View cover image

Collin Burns On Discovering Latent Knowledge In Language Models Without Supervision

The Inside View

CHAPTER

Is Blenderbot a Good Benchmark for AGI?

I think I didn't update that much on progress this year because I already had relatively short timelines. So the question is like the time it takes between just like people solving some like toy math problems to like they're being closed to solving like gold medals and then it becomes like I was like a trophy. If we talk about like other benchmarks like apps and MLUYeah I think we've seen the results also on these like did you update at all in the progress this year where you're like I already like with like short timelines.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner