I've been kicking this question of emergence around from a bunch of different angles as well and trying to just figure out first of all what matters. What really matters for like users society companies is at the end of a training process what can a model do or not do and how general is that capability? That seems like the key thing that like matters most it does seem to be true that you know per that mirage paper that if you find one of these kind of surprises and then rewind and say well okay let me actually measure that performance at every you know increment of the training process then you can plot like a smoother curve. There's this kind of phase change that is often happening between like

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode