There was an interesting paper that came out from I think this was pretty small scale. It was at Vanthropic where they experimented with doing the pre training on a human preference data set, as opposed to just kind of, you know random decent quality text off the Internet. Another recent thing that jumped out to me was somebody just published a result where they were able to increase the size of the model progressively throughout training. And it's something like that kind of changing the data set, maybe from the get go seems interesting.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode