3min chapter

AXRP - the AI X-risk Research Podcast cover image

22 - Shard Theory with Quintin Pope

AXRP - the AI X-risk Research Podcast

CHAPTER

The Effect of Training Processes on Downstream Behaviors

I think there's like lots of room for scaling up very hands-off behavioral supervision and then once you have that sort of tooling available to you it seems like you can get a much better hand on how different training processes influence downstream behaviors. This high level overview of how all those different training processes change the model's behavior puts you in a much better position for like iterating on how to train models into doing things we want them to do off training distribution inputs okay so that's like what I'm currently working on and who are you working on that with? um three peopleUm so like two of them were Matt's Scholars Roman, Roman, Engler and Owen Dundly Matt Scholars from

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode