TalkRL: The Reinforcement Learning Podcast cover image

Karol Hausman and Fei Xia

TalkRL: The Reinforcement Learning Podcast

00:00

Using the General Knowledge in the Language Model

Saken is just the very first step showing how you can do, how you can make these open l plans. They're very myopic. Saken to realize that, well, i failed. I should probably try it again, or you don't try to fix it. Somehow, some of the, some of these things, we are starting to work on how to make it little less myopic,. and can have optimized all the steps so that you can think about your plan more halistically.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app