TalkRL: The Reinforcement Learning Podcast cover image

Karol Hausman and Fei Xia

TalkRL: The Reinforcement Learning Podcast

00:00

How Do You Get a General Language Model to Produce a List of Steps?

In saken, grounding refers to this idea of fordance models. These are that allow you to predict what is the success right of doing a certain task en given a current state. So like if thi human said, how would you get a sponge from the counter and put it in the sink? Then the robot comes back with a list of steps. The second step will repeat its process until it outputs a don token,. which means the entire task sequence is finished.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app