TalkRL: The Reinforcement Learning Podcast cover image

Rohin Shah

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Do You Have a Research Career Plan?

The plan is to train models using human feedback, and then like mpower tthe humans providing the feedback as much as he can. Roan: Knowing everything that the model knows is a pretty high bar, and probably we won't get to it. But they are like a bunch of tricks that we can do that get us closer and closer to it. So ye, please do apply if you are interested in working on the a alinment.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner