
Rohin Shah
TalkRL: The Reinforcement Learning Podcast
Do You Have a Research Career Plan?
The plan is to train models using human feedback, and then like mpower tthe humans providing the feedback as much as he can. Roan: Knowing everything that the model knows is a pretty high bar, and probably we won't get to it. But they are like a bunch of tricks that we can do that get us closer and closer to it. So ye, please do apply if you are interested in working on the a alinment.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.