TalkRL: The Reinforcement Learning Podcast cover image

Rohin Shah

TalkRL: The Reinforcement Learning Podcast

00:00

The Top Two Approaches

Team cairos used 80 thousand plus labelled images and built some very specific components for this. Team obsidian produced this inverse cue learning method which has seemed like more general, theoretical solution. Even even the top tame did rely on a behavior cloned navigation policy that used the neral network. It shows you if you're just actually trying to get good performance, do you put inor or put in domain knowledge? And how much domain knowledge do you putting in? And a how do you do it?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app