BBF: A Better Learning Algorithm Than Rainbow

I absolutely believe that some RL squared or meta RL algorithm could come up with a dramatically more efficient learning algorithm than BBF. The other thing is BBF is really not designed for ultra-hard exploration problems. I would think of BBF as essentially a bigger, better, and faster version of Rainbow. But that means probably no continuous control unless you figure out action space to screen discretization for your problem.

Play episode from 52:53

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app