Learning the Doubt Regarding Humans

The first step for me was to actually understand where the problem is. There's nothing like that asks you to do episodic research. A lot of reinforcement learning formalism uses these infinite horizon settings. So it principally like all the algorithms should work without like, but when you actually use them, they're complete garbage.

Play episode from 19:51

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app