2min chapter

The Thesis Review cover image

[11] Jacob Andreas - Learning from Language

The Thesis Review

CHAPTER

How to Beat Monazoom Is Revenge From Scratch

For the plan representations that were the sketch representations that we were using in the policy sketches paper there wasn't even that much structure. Really just I have a sequence of sub tasks the sub tasks have names the names tell you which tasks share sub tasks. And so the point there is just that this is an extremely simple kind of annotation to gather in addition to whatever sort of work you were building doing to build your reinforcement learning agent in the first place. So if I can you know spend. Three weeks tuning hyper parameters and it takes me five minutes to write down a text file that contains a little bit of information extra information about the structure of my problems. What can I actually do with the kind

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode